CHAPTER 

1 


The Origins of Quantum 
Mechanics 


1.1 ■ INTRODUCTION 

The story of the development of quantum mechanics has attained mythic stature 
in the history of physics. By the turn of the previous century (c. 1900), physics 
encompassed the fields of classical mechanics, electricity and magnetism, and 
thermodynamics, a set of subjects now known collectively as “classical physics.” 
Indeed, with a few minor exceptions, the concepts of classical physics seemed 
capable of explaining all known physical phenomena. In a few decades, this entire 
framework would be superseded by the development of quantum mechanics. The 
importance of this development can hardly be exaggerated. The term “modem 
physics” is practically a synonym for the areas of physics which grew out of quan¬ 
tum mechanics: atomic physics, nuclear physics, particle physics, and condensed 
matter physics. 

The predictions of quantum mechanics are rather bizarre from a classical point 
of view. Consider the following propositions, which are all postulates of classical 
physics: 

1. The physical universe is deterministic, i.e., given enough information about 
a physical system, its future evolution can be predicted exactly. Who would 
dispute this obvious point? The entire function of classical mechanics is to 
derive such predictions. 

2. Light consists of waves, while ordinary matter is composed of particles. The 
former statement is one of the triumphs of classical electromagnetism, while 
the latter seems self-evident. 

3. Physical quantities, such as energy and angular momentum, can be treated 
as continuous variables. Again, this assumption is built into the structure of 
classical mechanics. 

4. There exists an objective physical reality independent of any observer. If a 
tree falls in the woods, of course it makes a sound. 

All of these ideas seem obvious. In fact, we know from quantum mechanics that 
none of them is completely accurate: 

1. The physical universe is not deterministic. At the subatomic level, we can 
assign probabilities to the outcomes of certain experiments but never predict 


1 



2 


Chapter 1 The Origins of Quantum Mechanics 


the exact result with certainty. Uncertainty is an intrinsic property of matter 
at this level. 

2. Both light and matter exhibit behavior that seems characteristic of both 
particles and waves. 

3. Under certain circumstances, some physical quantities are quantized, i.e.. 
they can take on only certain discrete values. 

4. Finally, it appears that the observer always affects the experiment; it is im¬ 
possible to disentangle the two. 

Why would anyone believe such a preposterous set of ideas? For the only reason 
that any theory in physics is given credence: because it works. Quantum mechanics 
allows us to explain physical phenomena, primarily at very small length scales, for 
which classical physics simply offers no explanation. Physicists in the first half of 
the 20th century were themselves often hesitant to accept many of the more bizarre 
consequences of quantum mechanics, but the theory ultimately prevailed because 
it agreed with experiment. 

Because quantum mechanics is so counterintuitive, so strange in many of its 
predictions, we will begin by examining some of the experiments which led to 
its birth. In each of the experiments examined below, physicists had developed 
a classical theory which failed to account correctly for the experimental results. 
These experiments led to the idea that light could behave as both a particle and 
a wave and then to the more radical suggestion that matter also had both particle 
and wave properties. These ideas set the stage for the development of quantum 
mechanics. 


1.2 ■ BLACKBODY RADIATION 

It is one of the ironies of physics that the greatest discoveries often occur in the areas 
where one would least expect to find them. This is particularly true of quantum 
mechanics whose origins are often traced back to perhaps the least glamorous 
subject in all of physics: thermodynamics. Specifically, quantum mechanics was 
launched by a solution to a nagging problem in thermodynamics, the behavior 
of blackbody radiation. Although the solution to this problem is not the most 
compelling argument for quantum mechanics, it is of such historical importance 
that we examine it in some detail. 

The Problem with Blackbody Radiation 

Blackbody radiation seems like a contradiction: objects are black precisely be¬ 
cause they absorb radiation, so how can a blackbody be said to emit radiation? 
The confusion arises because there are two different ways that we can detect radi¬ 
ation from an object: it can reflect light, or it can emit light from its own internal 
energy. Almost all objects in our everyday environment give off visible light by 
reflection, and it is in this sense that a black object absorbs everything and reflects 



1,2 Blackbodv Radiation 


3 


nothing. However, we are interested in the second case, the emission of radia¬ 
tion from an object’s internal energy. When objects are heated to a high enough 
temperature, they emit visible light. Familiar examples include the filament in an 
electric light bulb or the burner element in an electric stove. The actual spectrum 
of radiation produced at a given temperature will vary from one object to the next, 
but a blackbody is unique in this regard: it emits radiation with equal efficiency at 
all frequencies. 

The reason for this lies in a concept from thermodynamics called Kirchhoffs 
law': a body at the same temperature as its surroundings will emit radiation with 
the same efficiency at which it absorbs it. Imagine what would happen if this was 
not the case: if the body absorbed radiation at a different rate than it emitted it, over 
time the object would gradually heat up or cool down catastrophically! However, 
this argument can be made stronger: a body must absorb and give off radiation at 
the same rate at every frequency. Again, imagine an object at the same temperature 
as its environment but now surrounded with a filter which allows only a narrow 
range of frequencies of radiation to pass through. Emission and absorption must be 
exactly balanced within this range of frequencies in order for the object to remain 
at the same temperature as its surroundings. 

So Kirchhoffs law says that a good absorber of radiation will also be a good 
emitter. Hence, a body which absorbs radiation with perfect efficiency at all fre¬ 
quencies (a ^blackbody") must also emit radiation w ith perfect efficiency at all 
frequencies. In practice, most things in nature are not perfectly black; they re¬ 
flect some light and are therefore not perfect blackbodies. To construct a practical 
blackbody emitter, w ; e can use a cavity with a very small hole (Figure 1.1). Then 
light inside the cavity will reflect many times before it can leave the cavity. Even 



FIGURE 1.1 A cavity with a small hole behaves like a blackbody. 


4 


Chapter 1 The Origins of Quantum Mechanics 


if the walls of the cavity are not perfectly absorbing, the probability that the light 
will escape becomes smaller with each reflection and can be made infinitesimally 
small. Then the hole of the cavity acts like a blackbody, and the cavity itself is 
filled with blackbody radiation. 

The experimental properties of blackbody radiation were well established in 
the 19th century. First consider the total power given off by a blackbody. This was 
first measured by J. Stefan in 1879. The power emitted scales as the surface area 
A of the blackbody, and Stefan discovered that it also scales as the fourth power 
of the temperature T, measured in Kelvin: 

P = a AT 4 (1.1) 


where a is a constant called the Stefan-Boltzmann constant, and Equation (1.1) 
is called the Stefan-Boltzmann law. (Boltzmann showed how the T 4 dependence 
could be derived theoretically.) The Stefan-Boltzmann constant is measured to be 

a = 5.67 x 1CT 8 J sec -1 m~ 2 K~ 4 


A more convenient quantity to work with is the total energy density p of radiation 
inside a blackbody cavity. In terms of the power radiated, this is given by p = 
(4/c)(P / A), where c is the speed of light, so that 


p = aT 4 


( 1 . 2 ) 


with 


a = 7.56 x 1(T 16 J m" 3 K“ 4 

The spectrum of radiation, i.e., the energy density at a given frequency, can 
also be measured. This spectrum is expressed as p(v)dv, the total energy density 
in blackbody radiation between y and v + dv, where v is the frequency of the 
radiation in Hz. (Of course, p(v)dv will also be a function of temperature; it is 
understood that the spectrum is measured at a fixed value of T.) A plot of the 
measured spectrum is shown in Figure 1.2 for three different temperatures. 

The spectrum shows two obvious features. First, the amplitude of the spectrum 
increases with temperature. This is not surprising, since we know that the total 
energy density must increase as 7~ 4 , and this total energy density is just the spectrum 
integrated over all frequencies: 

POQ 

p = I p(v)dv = aT 4 

Jo 

In Figure 1.2, the total energy density is just the area under each curve. A second 
interesting feature is that the curves shift to the right (higher frequencies) as we go 
to higher temperatures. More quantitatively, it is observed that the frequency of the 
peak energy density (or peak emission from the blackbody) scales linearly with 



1.2 Blackbody Radiation 


5 


p(v) (J m ’ Hz "’) 



FIGURE 1.2 The energy density p(v) of blackbody radiation as a function of the fre¬ 
quency v at temperatures of T = 1000 K. 1500 K, and 2000 K. The vertical lines show the 
peak in the energy density at each temperature. 


the temperature: v pea k oc 7’. Since the wavelength scales inversely with frequency, 
this relation is often expressed as 


^ peak — tt 7 7 


where w is a constant (w = 2.90 x 10“ ' m K). This empirical result is known 
as Wien's displacement law. At room temperature, blackbodies radiate primarily 
in the infrared (hence the usefulness of night vision goggles). At temperatures of 
several thousands of degrees, the radiation shifts into the optical range, giving the 
familiar phenomenon of a heated object first glowing "red-hot" then “white-hot.” 
Physicists of the 19th century had a theory to explain the observed behavior of the 
blackbody spectrum; the only problem was that their theory failed to produce the 
observed spectrum. 

To understand this classical theory, we need several ideas from thermodynamics, 
which we will quote without derivation. Consider a collection of electromagnetic 
waves inside a blackbody cavity at a temperature 77 The energy density of the 
radiation is just the average energy of the waves multiplied by their number den¬ 
sity. The average energy £ of a set of classical oscillators is proportional to the 




6 


Chapter 1 The Origins of Quantum Mechanics 


temperature T : 


E =kT 

where the constant k = 1.38 x 10~ 23 J K _l is called Boltzmann’s constant. Clas¬ 
sical physics also predicts the number density of waves n(v)dv with frequencies 
between v and v + d v to be 


n(v) dv 



(1.3) 


Then the total energy density is just the number density of waves multiplied by the 
average energy per wave: 


p(v) - n{v)E 


giving 


p(v)dv = 


8 nkT 


v 2 dv 


(1.4) 


Equation (1.4) is called the Rayleigh-Jeans formula for blackbody radiation, and 
it is based on correct arguments from thermodynamics. The only problem is, it 
doesn’t work. In Figure 1.3, we compare the predictions of this formula to an 
actual blackbody spectrum at a temperature T — 2000 K. 

The Rayleigh-Jeans formula is not a complete failure. At the low-frequency 
end of the spectrum, it gives good agreement with the observations. But it fails 
at the high-energy end of the spectrum, and here it fails in a spectacular fashion. 
According to Equation (1.4). the energy density of radiation should increase with 
increasing frequency all of the way up to infinity! A heated object would give 
off more ultraviolet light than visible light and more X-rays than either; a person 
sitting in front of a fireplace would be killed by radiation! This problem came 
to be known as the ultraviolet catastrophe. (In physics, it is common to use the 
terms “infrared” and “ultraviolet” to refer generically to the low-frequency and 
high-frequency limits of any spectrum.) 

It was Max Planck who found the correct formula for the blackbody radiation 
spectrum, and in so doing* inadvertantly developed the beginnings of quantum 
mechanics. In order to understand Planck’s reasoning, we need to make use of 
another result from thermodynamics. Consider a collection of interacting parti¬ 
cles such as gas molecules in a box or radiation in a blackbody cavity. We have 
already seen that the average energy of these particles E is proportional to their 
temperature: E = kT. Now we need an expression for the distribution of energies 
for these particles, i.e., the probability that a randomly-chosen particle will have 
a particular energy. If we pick a random particle in our distribution, then we will 
define P(E)dE to be the probability that it has energy between E and E + dE. 
It is observed that for a wide variety of systems, P(E) has a universal form called 




1.2 Blackbody Radiation 


7 


p(v) (Jin 3 Hz l ) 



FIGURE 1.3 The solid curve gives the observed spectrum of blackbody radiation at 
T = 2000 K, and the dashed curve gives the prediction of the Rayleigh-Jeans formula. 


the Boltzmann distribution , given by 

e -E/kT 

P(E) = — (1*5) 

Using this expression, the mean energy of the particles is given by its classical 
value: 


£ _ J?P(E)EdE 
/* P(E)dE 


= kT 


( 1 . 6 ) 


So if we blindly use the Boltzmann distribution given by Equation (1.5), we will 
still end up with the Rayleigh-Jeans formula, Equation (1.4), which fails at high 
frequencies. 

Planck’s idea was to modify the theory to make E a function of frequency in 
such a way that E — kT at low frequencies (where the Rayleigh-Jeans formula 
works well), while E ^ kT at high frequencies (where the Rayleigh-Jeans formula 
fails). In order to do this, Planck assumed that E could no longer take on arbitrary 
values but only discrete multiples of some fundamental energy. Further, he took 
this fundamental energy to be proportional to the frequency v, with a constant of 



8 


Chapter 1 The Origins of Quantum Mechanics 


proportionality /?. (The value of/? is not predicted by Planck’s theory but must be 
chosen to fit the observations.) Therefore, according to Planck, the allowed values 
for E are simply 


E — 0. hv. 2hv. 3 hv,... 


Clearly, /? must have units of energy/frequency, or Jsec. With this assumption. 
Equation (1.6) can no longer be taken to be an integral over a continuous range of 
values for E , but instead is a sum over the allowed discrete values: 


£=(). 

DC 

£ 

n= 0 


P{E)E j P(E) 

hv, 2hv.... ' £=0. hv. 2hr.... 


nhv 
——( 

kT 


-nh 


v/kT j y' _J__ - nhv/kT 

^ kT 
' «=o 


hv 


„hv!kT 


which is obviously different from Equation (1.6). Taking this expression for E and 
multiplying by the number density of waves in Equation (1.3), we obtain Planck’s 
expression for the spectrum of blackbody radiation: 


8rch v 3 


p(v)av ehv / kT _ 

— Cl V 

1 


(1.7) 


Example 1.1. The Classical Limit of the Planck Blackbody Spectrum. 

Show that the Planck blackbody spectrum reduces to the Rayleigh-Jeans formula 
in the limit of low frequencies. 

We begin with Equation (1.7) and assume that hv/kT <$C 1. Recall that for 
small x , 


e x 1 + Jt 

Thus, Equation (1.7) becomes, tor hv/kT 1, 


p(v) dv = 


SttIi 


,3 


r 3 1 + hv/kT - 
SirkT 


dv 


/'3 


-v" dv 


which is the Rayleigh-Jeans formula. 

This result explains why the Rayleigh-Jeans formula works w'ell at low fre¬ 
quencies but fails at high frequencies. It also indicates the frequency range over 

which the Rayleigh-Jeans formula works well: v <^kT/h. 





1,2 Blackbody Radiation 


9 


The Planck spectrum can be used to derive both the Stefan-Boltzmann law and 
Wien’s law, and these results can be used, in turn, to calculate the value of h. The 
total energy density of blackbody radiation is simply the integral of the Planck 
spectrum over all frequencies: 

p = / p(y) dv 
Jo 

f x 8nli v* 

~ J Q 73 “ e h V /kr- Z 1 dv 

The integral can be put into simpler form by making the change of variables 
.v = hv/ kT: this gives 




The integral can be evaluated exactly: 



71 

Is 


giving 



( 1 . 8 ) 


A comparison of Equation (1.8) with Equation (1.2) shows the correct T 4 depen¬ 
dence of p. 

Now consider the frequency at which the maximum emission occurs. We begin 
with Equation (1.7) for p(v), and set dp/dv = 0 to find frequency v peak at which 
p(v) is a maximum: 


dp {Vpeak) 

dv 


hv 


peak 


kT 


| gfoVpetiklkl __ ^ 


0 


This has a trivial solution at v peak = 0. The nontrivial solution, which is the one 
we want, cannot be calculated algebraically, but it can be found numerically: 
hv peak /kT ^ 2.8, so 


Vpeak = 2.8 kTl h 

This result is consistent with Wien’s law: the value of v peak is proportional to the 
temperature T. 

Not only have we shown that the Planck spectrum predicts both the Stefan- 
Boltzmann law and Wien’s law, but we have derived expressions for the corre¬ 
sponding constants of proportionality in both laws. Since the speed of light c is 





10 


Chapter 1 The Origins of Quantum Mechanics 


known from other experiments, it is possible to compare these two expressions 
with the experimentally observed constants of proportionality to derive values for 
both h and k\ Planck did exactly that, obtaining very accurate results tin terms of 
modern measurements) for both constants. The best current measurements of k 
and h give 



When these values are inserted into Equation (i .7), the result is a predicted black- 
body spectrum in excellent agreement with the observed spectrum. 


1.3 ■THE NATURE OF LIGHT 

Planck showed that the spectrum of blackbody radiation could be explained only 
if the energies of light waves in a blackbody cavity were restricted to discrete 
values proportional to their frequency. E = nhv. Planck's solution to the problem 
of blackbody radiation clearly indicated something odd about the nature of the 
radiation, but the physical interpretation of his proposal was not at all obvious. 
In the decade after Planck made this proposal, several subsequent experiments 
clarified what was really going on: light behaves like a gas of particles called 
photons, and the energy of each photon is given by hv. 

This idea hearkens back to the original theory of light proposed by Newton in 
the 1600's. In Newton's “corpuscular theory,” light consisted of particles, which 
obeyed the laws of classical mechanics. This theory could explain, for instance, the 
way in which light reflects from surfaces. However, by the 1800’s, it was clear that 
many observations could be explained only if light consisted of waves rather than 
particles. These observations included such well-known phenomena as diffraction 
and interference (Figure 1.4). Maxwell’s equations, derived from observations of 
electromagnetic phenomena, actually predict the wave nature of light in the form 
of oscillating electric and magnetic fields. By the beginning of the 20th century, the 
wave nature of light was well established. We will now consider two experiments 
that reestablished the interpretation of light as a collection of particles. 

The Photoelectric Effect 

Light shining on a metal plate is observed to produce a current; this effect is called 
the photoelectric effect (sec Figure 1.5). The current and maximum energy of the 
photoelectrons can be measured as the intensity and frequency of the light are 
varied. |The maximum energy is measured by putting a potential across the circuit 
until the current stops; if <t> () is the potential which stops the current from flowing, 
then the corresponding maximum electron energy is E max = e<t> () .] 




1.3 The Nature of Light 


11 



FIGURE 1.4 Diffraction (left) and interference (right) indicate that light consists of 
waves. 


Here is what is observed; 

1. The current is proportional to the intensity of the light (i.e., the power per 
unit area falling on the plate, measured in W/m 2 ). 

2, The maximum energy of the photoelectrons E max is proportional to the fre¬ 
quency of the light (Figure 1.6). Furthermore, there is a minimum frequency 
below which no current is observed; this minimum frequency depends on 
the composition of the metal plate. 

It is the second of these observations which is puzzling. In the classical theory of 
electromagnetic radiation, the energy put into the system should be proportional to 
the intensity of the light, and the frequency should play no role at all in determining 
F 

l -max • 

It was Albert Einstein in 1905, who came up with the explanation for the pho¬ 
toelectric effect. He proposed that light, in this case, behaves as a collection of 
particles called photons. Furthermore, the energy of each photon is given by hv . 
When a photon strikes an electron in the metal plate, it can transfer a maximum 
energy of hv to the electron. This would suggest that E max = hv. However, there is 
an additional complication. The electrons are bound in the metal with some bind¬ 
ing energy E B . The photon must transfer enough energy to remove the electron 
from the metal; whatever energy is left over then goes into the kinetic energy of 



12 


Chapter 1 The Origins of Quantum Mechanics 



Emax 



FIGURE 1.6 The maximum electron energy E max measured in the photoelectron effect, 
as a function of the frequency of light v. 

the electron. We therefore have 


Emax — ^ v Eg 


Although Eg will vary from one metal to the next, Einstein’s theory makes one 
universal prediction: the slope of the graph of E max versus v should be given by 
Planck’s constant h. This is exactly what is observed. In fact, although Einstein is 
most famous for the theory of relativity, he won his Nobel Prize for this explanation 
of the photoelectric effect. 



1.3 The Nature of Light 


13 



FIGURE 1.7 An example of the Compton scattering experiment. Gamma rays from a 
radioactive source scatter off of electrons in a cylindrical metal target. The wavelength of 
the scattered radiation is measured as a function of the scattering angle 6. 


The fact that Planck’s constant appears in two very different phenomena (black- 
body radiation and the photoelectric effect) suggests that it has a fundamental phys¬ 
ical significance. Further, Einstein’s theory indicates something very radical about 
the nature of light: it behaves as a particle with energy hv. Further confirmation 
of this behavior was provided by the Compton effect. 

The Compton Effect 

One of the characteristics of particles is that they can scatter off of each other, 
conserving both energy and momentum in the scattering process. If light truly does 
behave like a particle, it should be possible to observe such scattering processes 
and to predict the change in the energy and momentum of the light when it scatters. 
One such process that is observed to occur is Compton scattering, which refers to 
the scattering of X-rays or gamma rays off of the electrons in a metal (Figure 1.7). 
Experimentally, it is observed that the wavelength of the scattered radiation Xf is 
larger than that of the incident radiation A,-, and the change in wavelength is well 
fit by the relation 


A/ — kj = Xc (1 — cos 6) 


(1-9) 


where Xc, called the Compton wavelength of the electron, is a constant with units 
of length 


Xc — 2.4 x 10 12 m 


and 9 is the scattering angle shown in Figure 1.7. Note from Equation (1.9) that 
the change in wavelength is actually independent of the initial wavelength A, ; it 
depends only on the scattering angle 6. 





14 


Chapter 1 


The Origins of Quantum Mechanics 



FIGURE 1.8 A photon is incident with energy E, and momentum p,. It scatters off of an 
electron, emerging at an angle 8 with final energy £ f - and final momentum p^.. The electron 
ends up with final energy E ( , and final momentum p ( ,. 


The Compton effect cannot be explained by the classical wave theory of light. 
Classically, a light wave scattering off of an electron excites the electron to oscillate 
at the same frequency as the incident wave, and the oscillating electron produces 
radiation with the same frequency. Hence, light scattering from an electron un¬ 
dergoes no change in frequency. However, if we treat the light as consisting of 
particles, we will not only be able to derive the correct behavior given in Equa¬ 
tion (1.9). but also to obtain the correct value for Xc . 

To derive Equation (1.9), we will assume that the radiation consists of particles 
with energy hv, and we will treat the Compton effect as a scattering problem in 
classical mechanics (see Figure 1.8). We need to be careful to use the correct rel¬ 
ativistic expressions for energy and momentum here. Recall that special relativity 
gives 

r2 22, 24 

E — pc- f- m 0 c 

for a particle with mass m o. For the electron we simply take m o = in,.. For a 
photon we know that E — hv , but what do we assume for the rest mass? Any 
particle moving at the speed of light must have zero rest mass, so that E 2 = p 2 c ", 
and 

E — pc 


for photons. 

Applying conservation of momentum to the system in Figure 1.8 gives 

P/=P/ + P,' (1.10) 

while energy conservation gives 

E, -F m e c 2 = E f + E e (1.11) 

where / and f refer to the incoming and outgoing photon, respectively, and e refers 
to the final state of the electron after scattering (see Figure 1.8). Note that since we 




13 The Nature of Light 


15 


are using a fully relativistic treatment, we include the rest energy of the electron 
on the left-hand side of the equation. We want to eliminate the electron energy 
and momentum from both equations, so we begin by rewriting Equation (1.10) 
as p/ — p f = p e and squaring both sides (i.e., taking the dot product of each side 
w'ith itself) to get 

pj + p) - 2 Pi p f (cos 9) = p; (1.12) 

Similarly, rearranging terms in Equation (1.11) and squaring, we get 

{Ei - E f +m e c 2 ) 2 = E; (1.13) 

We now make the appropriate substitutions E, = p,c, Ef — pfC, and Ej = 
pjc 2 + m~,c A into Equation (1.13), and simplify to obtain 

(Pi - Pf) 2 + 2 (pi - p f )m e c = p] (1.14) 

We can now equate the right-hand sides of Equations (1.12) and (1.14) and reduce 
the resulting equation to the form 

—- - = —(1-cosO) (1.15) 

Pf Pi m e c 

At this point, we need to express the photon momenta in terms of their wavelengths. 
Since we have E = hv, v = c/X, and E = pc, we obtain 

p = h/X (1.16) 


Then Equation (1.15) reduces to 


A f — Xj = -(1 — COS 0) 

m e c 


This is exactly the same form as Equation (1.9) with 

Xc = h/nifC 

Substituting the values for h. m e , and c, we indeed obtain the measured value of 
Xc (= 2.4 x 10 12 m). Thus, the Compton effect can be explained by assuming 
that the X-rays or gamma rays act as particles with energy E = hv and momentum 
p = h/X. 

Is it a Particle or a Wave? 

Both the photoelectric effect and the Compton effect provide evidence that light 
acts like a particle with energy given by E — hv. But this does not eliminate the 
results of classical optics in which light behaves like a wave. Instead, we are forced 
to accept the idea of wave-particle duality: light behaves sometimes as wave and 
sometimes as a particle. This provides the stepping-stone to a more radical idea: 
if light can exhibit both wave and particle properties, what about matter? 





16 


Chapter 1 The Origins of Quantum Mechanics 


1.4 BTHE WAVE NATURE OF MATTER 

We saw in Section 1.3 that light can behave as both a particle and a wave. On this 
basis, Louis de Broglie made a much more startling proposal: that matter, which 
is composed of particles, might also behave like a wave. (Louis de Broglie was 
one of the last surviving founders of quantum mechanics, dying in 1987 at the age 
of 95.) In particular, de Broglie proposed using the relation between momentum 
and wavelength appropriate for a photon (Equation 1.16) and applying it to matter. 
The de Broglie wavelength for a particle is then 


a = h/p 


(1.17) 


This proposal seems absurd. If someone's body behaved like a wave, it would 
diffract every time that person walked through a door. He or she could even walk 
through a classroom wall having two doors and form an interference pattern on 
the front wall! But there is a good reason that these phenomena are not observed. 


Example 1.2. The de Broglie Wavelength of a Walking Human. 

Consider a 70 kg human being walking at 1 m sec -1 . The momentum is 

p = mv = 70 kg m sec -1 
and Equation (1.17) gives a de Broglie wavelength of 

a = h/p = 9 x l(r’ c ’m 

This is a tiny wavelength compared to ordinary human length scales; in fact, it is 
tiny compared to atomic or nuclear scales! Therefore, it is not surprising that we 
do not see any w'ave-like effects at human length scales. 


In order to see some effect from the wave nature of matter, it is necessary to 
conduct an experiment in which A is significant compared to the characteristic 
size of the physical system. To maximize A, it is desirable to use the smallest 
possible momentum p. For a nonrelativistic particle of mass m , the relation between 
(kinetic) energy E and momentum p is p — s/lmE. Hence, at fixed energy, p will 
be minimized and a maximized when m is as small as possible. This makes the 
electron (which has a much smaller mass than the proton or neutron) a convenient 
particle to use. and in order for the electrons to behave like waves, they must be 
scattered through an "aperture" with a size comparable to or smaller than their 
wavelength. This can be achieved by scattering electrons from a crystal lattice, 
since the separation of the atoms in the lattice is on the order of 10 -l ° m. 

In an experiment called the Davisson—Gentter experiment (Figure 1.9). Davis¬ 
son and Germer scattered electrons off of a nickel sheet and measured the intensity 
of the electrons as a function of the scattering angle. An interference pattern was 




1.5 The Bohr Atom 


17 


Incident electrons Scattered electrons 



FIGURE 1.9 In the Davisson-Germer experiment, electrons are scattered from a crystal 
lattice. Constructive interference occurs when the path difference between rays scattering 
from adjacent planes is an integer multiple of the wavelength of the electrons. 


produced, which can be explained by taking the electrons to behave like waves 
with wavelength given by Equation (1.17). 

Although we have applied the de Broglie postulate only to electrons, it was 
proposed for all matter. Thus, we are left with an interesting symmetry between 
radiation and matter: both can be considered to behave as both particles and waves. 


1.5 BTHE BOHR ATOM 

We now examine the most important result from the early stages of quantum 
mechanics: Neils Bohr’s model of the atom. Ernest Rutherford’s experiments in 
the scattering of alpha particles from atoms led to our familiar modern picture of 
the atom: a small, positively-charged, central nucleus surrounded by negatively- 
charged electrons orbiting the nucleus. There are, however, two problems with 
this picture. According to classical electromagnetic theory, any accelerating charge 
will emit electromagnetic radiation. Since the electrons must undergo centripetal 
acceleration in order to orbit the nucleus, they should give off radiation. Not only 
is this radiation not observed, but if it did occur, the electrons would lose energy 
and spiral into the nucleus; every atom w ; ould be unstable! 

The second problem wath this classical picture is that it cannot explain one of 
the most striking features of hot gasses. If hydrogen gas. for example, is heated, 
the radiation it emits is not a continuous spectrum of wavelengths. Instead, the 
radiation is confined to discrete wavelengths, producing a series of bright lines in 
the spectrum. These lines were measured in the 19th century and found to obey a 
regular pattern. For hydrogen, for example, the wavelengths at which radiation is 



18 


Chapter I The Origins of Quantum Mechanics 


emitted are given by the formula 



where m = 1.2, 3.... and n =2.3,4.... with n > in. and R is a constant (called 
the Rydberg constant) with a value of R — 1.097 x 10' m~ ! . This was a purely 
empirical relation discovered (for the special case m = 2) by Johann Balmer in 
1885. (In his honor, the m = 2 series of spectral lines is called the Balmer series.) 
Later, spectral lines were found for the other values of in: these are called the 
Lyman series (m = 1), the Paschen series, (m — 3), and so on. The Rutherford 
model of the atom provides no explanation for this observed numerological result. 

Neils Bohr proposed a model for the atom which solves both of these prob¬ 
lems. Bohr assumed that the angular momentum of the electron in a hydrogen 
atom could not take on arbitrary (continuous) values but instead was quantized, 
i.e., constrained to take on only discrete values, which he took to have units of 

h/2n: L = nh/2 tc, n — 1,2,3 . We now introduce a new version of Planck's 

constant, h (pronounced “h-bar”), given by 

h = h/2n 

In terms of h. Bohr's quantization condition becomes 

(1.19) 

Although Bohr expressed this condition in terms of angular momentum, it follows 
immediately from de Broglie's later hypothesis of matter waves. If the electron 
behaves like a standing wave around the hydrogen nucleus, then the circumference 
of its orbit must correspond to an integer number of wavelengths (Figure 1.10). 
Taking 

2nr = nk — nhj p 

we get 

L = pr = nhjln = nh 

which is just the Bohr quantization condition in Equation (1.19). 




FIGURE 1.10 The circumference of the electron orbit must correspond to an integer 
number of wavelengths of the electron. 




1.5 The Bohr Atom 


19 


The rest of Bohr’s calculation is purely classical. For a classical orbit, the 
Coulomb force on the electron must be equal to the centripetal force: 


F Coulomb — F cemr i pe ,al 


1 


mii‘ 


4ttco r 2 


( 1 . 20 ) 


Solving Equations (1.19) and (1.20) for v and r, we find that the electron orbiting 
the hydrogen atom can have only certain discrete values for its orbital radius and 
velocity, expressed in terms of the integer n. Specifically, 


r — 


me 2 


1,2,3 


( 1 . 21 ) 


and 


e 2 1 
4jreo^ «' 


n = 1,2, 3... 


( 1 . 22 ) 


The total energy of an electron with orbital radius r and velocity v is just the sum 
of its kinetic and potential energies: 


-mv — -- 

2 4.Te () r 


(1.23) 


Substituting the allowed discrete values for r and v from Equations (1.21) and 
(1.22) into Equation (1.23) gives a set of discrete allowed energy levels E n : 



(1.24) 


where we have expressed the hydrogen energy levels in units of electron-volts 
(eV); 1 eV = 1.6 x 10" 19 3 (Figure 1.11). The energies in Equation (1.24) are 
negative because the electron is in a bound state. 

The origin of the discrete spectral lines in the Bohr model arises from the 
discrete nature of the allowed energy levels. When an electron makes a transition 
from «i to tii, it gives off a photon with energy and frequency v = 

(E n , — £„,)//?. So the integers m and n which appear in Equation (1.18) have a 
real physical significance: they give the final and initial energy levels, respectively, 
for the electron, as parametrized in Equation (1.24). It is straightforward to use 
Equation (1.24) to derive the wavelengths of the spectral lines given in Equation 
(1.18) (see Exercise 1.15). 

The Bohr model was a triumph of the early stages of quantum mechanics. Since 
the n = 1 state in Equation (1.24) has the lowest possible energy, an electron in 



20 


Chapter 1 


The Origins of Quantum Mechanics 


-£ = () 


£ 3 = -1.5 eV 


E 2 = -3.4 eV 


-£, = -13.6 eV 

FIGURE 1.11 In the Bohr model of the atom, the energy levels are given by £., = 
— 13.6 eV( 1/n 2 ). 


this state cannot lose more energy via radiation, so the Bohr model explains the 
stability of the atom. Further, the Bohr model predicts the existence and correct 
wavelengths for the discrete lines observed in the spectrum of hydrogen. However, 
the Bohr model leaves much to be desired. It mixes classical mechanics and quan¬ 
tum mechanics in a regime for which, as we shall see later, classical mechanics 
does not apply at all. In particular, in the fully quantum mechanical theory of the 
atom, the electrons do not have a well-defined radius or velocity. Nonetheless, the 
full quantum mechanical theory of the hydrogen atom (discussed in Chapter 6) 
predicts the same energy levels as those given by the Bohr theory. 


1.6 ■ WHERE DO WE STAND? 

In this chapter, we have explored the development of quantum theory which took 
place up to the I920’s. In order to explain various experimental results, it was 
necessary to postulate that both matter and light could behave sometimes like a 
particle and sometimes like a wave. An application of the wave nature of matter 
could then explain the discrete energy levels in the hydrogen spectrum, as well as 
the stability of the hydrogen atom. This collection of ideas did not really form a 
coherent theory, but it became the basis for a more complete quantum theory that 
began to be developed in the late 1920's, based on the Schrodinger equation. It is 
this theory of quantum mechanics which is the primary topic of the remainder of 
this book. 



Exercises 


21 


EXERCISES 

LI Assume that a human body emits blackbody radiation at the standard body tempera¬ 
ture, 

(a) Estimate how much energy is radiated by the body in one hour. 

(b) At what wavelength does this radiation have its maximum intensity? 

1.2 A distant star is observed to have a blackbody spectrum with a maximum at a wave¬ 
length of 3500 A[1 A = 10 10 mj. What is the temperature of the star? 

1.3 The universe is filled with blackbody radiation at a temperature of 2.7 K left over from 
the Big Bang. [This radiation was discovered in 1965 by Bell Laboratory scientists, 
who thought at one point that they were seeing interference from pigeon droppings 
on their microwave receiver.] 

(a) What is the total energy density of this radiation? 

(b) What is the total energy density with wavelengths between 1 mm and 1.01 mm? 
Is the Ravleigh-Jeans formula a good approximation at these wavelengths? 

1.4 Over what range in frequencies does the Rayleigh-Jeans formula give a result within 
10% of the Planck blackbody spectrum? 

1.5 Let p(< Uq) be the total energy density of blackbody radiation in all frequencies less 
than u 0 , where h v ( , kT. Derive an expression for p(< v 0 ). 

1.6 Suppose we want to measure the total energy density in blackbody radiation above 
some cutoff frequency u 0 . Let p(> v 0 ) be the total radiation density in all frequen¬ 
cies greater than vq. Using the Planck blackbody spectrum show that p(> vo) = 
(8jr/c 3 )kr v? } e~ hvo/k f is a good approximation when hv$ is much larger than k T . 

1.7 ( a) Express the Planck spectrum (Equation 1.7) as a function of the wavelength X of 

the radiation, rather than the frequency i\ 

(b) Use this expression to derive the wavelength X peak at which the spectrum is a 
maximum. 

(C) Does X-p ea lc Vpeak —' C*? 

1.8 In a photoelectric experiment, electrons are emitted from a surface illuminated by 
light of wavelength 4000 A, and the stopping potential for these electrons is found 
to be 4> ( , — ().5 V. What is the longest wavelength of light that can illuminate this 
surface and still produce a photoelectric current? 

1.9 A lightbulb emits 40 W 7 of power at a wavelength of 6.0 x 10““ 7 m. 

(a) W ; hat is the total number of photons emitted per second? 

(b) What is the energy of each photon? 

1.10 (a) Using the Planck blackbody spectrum, and the fact that a photon with a frequency 
v has an energy of hv , derive an expression for n(v)d\h the total number density 
of photons with frequencies between v and v -f dv in blackbody radiation. 

(b) Using the expression from part (a), show that the total number density of photons 
in blackbody radiation is given by 


n = ft(kT / hcY 



22 


Chapter 1 The Origins of Quantum Mechanics 


where p is a constant given by ft % 60. [Note that the integral x 2 dx/(e x — 1) 
cannot be done analytically, so use the numerical result that x 2 dxj(e x — 1) «= 
2.4.] 

1.11 A gamma ray with energy 1.0 MeV i$ scattered off of an unknown particle which is at 
rest. The gamma ray is reflected directly backward with a final energy of 0.98 MeV. 
What is m {) c 2 for the unknown particle? (Express your answer in MeV.) 

1.12 Calculate the de Broglie wavelength of a proton (me 2 = 938 MeV) with 

(a) a kinetic energy of 0.1 MeV 

(b) a total energy of 3 GeV. 

1.13 The Balmer series (the m = 2 case in Equation 1.18) was discovered before the other 
series of spectral lines (m = E m = 3, etc.). Why? (Hint: Plug in some numbers and 
calculate wavelengths for m = 1, m = 2, and m = 3,) 

1.14 Verify that h has units of angular momentum. 

1.15 Beginning with the Bohr energy levels (Equation l .24). derive the expression for the 
wavelengths of the spectral lines in hydrogen (Equation 1.18) and use this result to 
express R as a function of m, e , h , e, and e<). Plug in values for these constants and 
verify that the correct result for R is obtained. 

1.16 Suppose that the attractive force between the electron and proton in the hydrogen 
atom as given by some power law other than the inverse square law, i.e., assume 
that the magnitude of the attractive force is given by F ~ krK w'here k is a constant, 
and fi is an arbitrary number with fi ^ I. [For example, the ordinary Coulomb law 
corresponds to the case fi = —2. The harmonic oscillator corresponds to fi = I.] Use 
the Bohr quantization rule to show that for fi ^ —L the energy levels of the atom are 
give by 


/ tm* 

V m 


2 2 \ <0+U/(j0+3) 


1 


Jt 2 ^ +3 >( - + 

.2 ^ + 


This formula gives an absurd answer when fi = —3; why? 



CHAPTER 


Math Interlude A: Complex 
Numbers and Linear Operators 



2.1 ■ COMPLEX NUMBERS 

Classical physics, with a few exceptions, relies on real numbers for its mathematical 
basis. Quantum mechanics marked the entry of complex numbers, in a fundamental 
way, into physics. Here we review the main properties of complex numbers for 
use in the remainder of this book. 

Consider a set of numbers 0, 1,2, ... and some operation such as addition. The 
set is said to be closed under the operation if whenever the operation is applied to 
the numbers in the set, the result is also in the set. For instance, the set of integers 
is closed under addition and multiplication. However, to get a set which is closed 
under subtraction, we need to include the negative numbers. Closure under division 
requires fractions, and the taking of various roots forces us to include the irrational 
numbers. 

Complex numbers arise when we try to take the square root of negative numbers. 
The square roots of the negative numbers are said to be imaginary, beginning with 
the square root of — 1: 


T = i 

It is convenient to think of the imaginary numbers as occupying a second number 
line perpendicular to the line occupied by the real numbers. The real and imaginary 
numbers can then be added just like two-dimensional vectors, resulting in the 
complex numbers which occupy this two-dimensional number plane (Figure 2.1). 
In general, an arbitrary complex number z can be written as the sum of a real 
number a and an imaginary number bi : 


z — a + bi (2.1) 

In Equation (2.1), a and b are both real numbers. 

Addition and subtraction of complex numbers is easy; just as for two- 
dimensional vectors, the real and imaginary parts are added or subtracted sepa¬ 
rately: 

(a -4- bi ) + (c + di) = (a + c) + (b + d)i 
( u T bi) — (c -t- di) — (a — c) -t- ( b — d)i 


23 




24 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 



FIGURE 2.1 The complex numbers can be treated as two-dimensional vectors in the 
complex plane: the real part of a complex number gives the horizontal coordinate in this 
plane, and the imaginary part gives the vertical coordinate. 

Multiplication and division are more subtle. For multiplication, it is possible to 
simply multiply out two complex numbers using the distributive property: 

(ci T- bi ) (c T di ) = cic -F bci -F cidi -f- bcl(i )( /) 

= (ac — bd) + (be + ad)i 

However, there is another way to implement complex multiplication based on a 
different way to represent complex numbers. Recall that the complex numbers form 
a two-dimensional plane, and there are two types of coordinate systems in the plane: 
Cartesian (or rectangular) coordinates and polar coordinates. The representation 
of a complex number as z — a + bi corresponds to Cartesian coordinates; we will 
now derive a polar representation. 

To do this, first consider exponentials of imaginary numbers. For an imaginary 
number id (where 0 is real), we can use the Taylor expansion of the exponential 
to give 

e 1 ' 1 = 1 + (id) + — (id)~ + -(/0) 3 -F —(i6) 4 + —— (iOf + • • • 

2 6 24 120 

The terms on the right-hand side of this equation that are even powers of (id) will 
give real numbers, while the odd powers will give imaginary numbers. Collecting 



2.1 Complex Numbers 


25 


Imaginary axis 


Real axis 


i 

FIGURE 2.2 Since e w — cos 6 + i sin 0, the complex number e‘ e has unit absolute value 
and lies at an angle 6 relative to the real axis. 


the even and odd powers separately, and factoring out i from the latter, gives 



But the two sums on the right-hand side of this equation are the Taylor expansions 
for cosine and sine, respectively. Hence, this equation can be written as 

( 2 . 2 ) 

As 6 increases from 0 to 2n, the function e li) traces out a unit circle in the complex 
plane with angle 9 relative to the positive real axis (Figure 2.2). Multiplying by the 
real number R then gives a complex number at a distance R from the origin and 
at an angle 9 relative to the positive real axis (Figure 2.3), written as Re ,H . This 
gives the polar representation of a complex number. Any complex number can be 
written in either Cartesian or polar form. When a complex number z is written in 
the form z = Re w , then R is called the modulus or the absolute value of z, and 9 
is called the argument of z- Note that 0 must be expressed in radians. Using the 
standard notation for absolute value, we can write |z| = R. 





26 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 


Imaginary axis 



FIGURE 2.3 The complex number z = Re n> is located at a distance R from the origin 
and at an angle 9 relative to the positive real axis. 


To convert from the Cartesian form of a complex number, a 4- bi, to polar form. 
Re 10 , and vice versa, we use Equation (2.2) to give 

Re' H — R cos B + i R sin d 


so that 


a = R cos 0 
h = R sind 


and conversely, 

R = \fcP- + ft 2 
9 = tan" 1 (b/a) 

Note that there is a subtlety in determining the value of 9 for a given a and b, because 
tan" 1 (.v) actually has two different values for a given choice of .r, separated from 
each other by the angle n. This ambiguity is resolved by noting that 6 must be 
chosen so that the complex number lies in the correct quandrant of the complex 



2.1 Complex Numbers 


27 


plane: 


0 < e < jt/2 
7t/2 < 6 < n 
7 t < 0 < 3n/2 
3n/2 < 0 < 2n 


a > 0, b > 0 
a < 0. b > 0 
a < 0. b < 0 
a > 0, b < 0 


Example 2.1. Converting a Complex Number to Polar Form. 

Express 2 + 2/ and 1 — / in polar form. 

For 2 + 2/, we have 


R = s/W+2? = 2^ 


and 


0 ~ tan '(2/2) = tan '(1) = n/4 or 5 ,t/ 4 
Since a > 0 and b > 0, the correct choice for 0 is n /4. Similarly, for I — /, we get 

R = /CTC = x/2 
and 

6> = tan 1 (~1/1) = tan -i (-1) = 3n/4 or 7jr/4 
and since a > 0 and b < 0, we must choose 7 jt/ 4. Hence, we have 

2 + 2/ = 2\/2e ,>/4 


and 


I — / = V'2e , ' 7 ' T/4 


It is now straightforward to multiply or divide two complex numbers, repre¬ 
sented as z.\ = and zi = : For multiplication. 


ziz 2 = (/?,e' y, )(/? 2 ^ % ) = R ] R 2 e iif '' +(h) 


Thus, when multiplying two complex numbers, the R's are multiplied and the 
&'$ are added. For example, multiplying a complex number by i simply rotates 
it 90° in the complex plane without changing its distance from the origin, while 
multiplication by — 1 is equivalent to rotation through 180°. Similarly, for division. 





28 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 


Example 2.2. Multiplication of Complex Numbers. 

What is (2 + 2/) (1 — /)? 

This can be solved in two different ways. Using the Cartesian form for these 
numbers. 

(2 + 2 /)(l -/) = ( 2 )( 1 ) + ( 20 ( 1 ) + ( 2)(-0 + ( 20(-0 
= 2 + 2 / - 2 / + 2 
= 4 

In polar form, we use the results from Example 2.1: 

2 + 2 / = 2v / 20 :r/4 


and 

1 - / = V2e iln/4 


Then 

(2 + 200 - 0 = (2 y/2e i:r/4 )( s/le ‘ l7T/4 ) 

= 4e i2n 
= 4 

Of course, the tinal answer cannot depend on whether we perform the multiplica¬ 
tion in Cartesian or polar form. 

W'hen a complex number is expressed in polar form, it is straightforward to take 
it to an arbitrary power: 


(Re 1 ")" = R n e h,e 


and the roots of a complex number can be determined in a similar way: 


VRe iH = (Re iH )' /n = V~Re ie,n 


Example 2.3. The Cube Root of 1. 

What is the cube root of l? 

In terms of real numbers, the cube root of 1 is just 1. However, when we consider 
complex numbers, we discover that 1 actually has three different cube roots! We 
have in polar form: 

l 3 = 1 
= ** = I 

= e i4rr _ j 





2.1 Complex Numbers 


29 


In the complex plane, these three numbers all lie on a unit circle separated from 
each other by an angle of 2tt/3 (or 60°). 


Finally, we must deal with one operation that is unique to complex numbers. 
This is called complex conjugation. If z = a + bi is an arbitrary complex number, 
then its complex conjugate, written as z or z* (we will use the latter notation), is 
given by 


z* — a — bi 


This corresponds to reflection in the complex plane through the real axis (Fig¬ 
ure 2.4). 

Some important properties of complex conjugation are 



Imaginary axis 



FIGURE 2.4 If z = a + bi, then z* = a — bi. corresponding to reflection through the 
real axis. 





30 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 

where w and z are any two complex numbers. In polar form, we have 

(. Re i0 )* = Re- W 

More generally, the complex conjugate of a complicated expression can be obtained 
simply by changing / to —i everywhere in the expression, e.g., 

(]+/ + £ ,x )* — 1 — i + <? 1 ' 


2.2 ■ OPERATORS 

Definition of an Operator 

The idea of a function is very familiar: a function is simply a fixed rule for taking a 
number and changing it into another number. A number is plugged into a function 
and out comes a different number. An operator is a rule for changing one function 
into another function (Figure 2.5). 

A familiar example of an operator is the derivative operator D, which takes 
an input function fix) and produces as its output the derivative of that function, 
df/dx: 

df 

D{f(x)] = -j- 
dx 


e.g., 


D[x 2 ] = 2x 
Dfsinx] = cosx 


and so on. We will be interested only in a special class of operators called linear 
operators. In order for an operator L to be a linear operator, it must satisfy two 
properties: first, for every pair of functions fix) and g(x). 


L[f(x) + g(x) ] = L[f(x )] + L[g(.r)] 


Function 

Operator 

fix) = V 3 

£>[£(*)] = ! 

/(1) = 1 

£>[jt 2 ] = 2x 

NJ 

II 

00 

/)[sin x] = cos .r 

/(3) = 27 

D[lnjc] = i 


Number in 




J 

i 

i 

Number out 

Function in— i 



Function out 


FIGURE 2.5 A function is a rule for taking a number and turning it into a different 
number. An operator is a rule for taking a function and turning it into a different function. 




22 Operators 


31 


and second, for every function f(x) and real or complex number c, 

L[cf(x )] = cL[f(x)] 


Example 2.4. Determining Whether Operators are Linear. 

Determine whether or not the following operators are linear: 

(a) A[g(x)] = g(x) 2 , 

(b) the derivative operator, D[g(x)] = dg/dx. 

(a) Note that 

A[f(x) + gix)] = [f(x) + gix)] 2 = f(x) 2 + gix) 2 + 2 fix)gix) 

and 

A[fix)} + A[g(x)] = fix) 2 + gix) 2 
Thus, A[f (x) + g(x)] A[f (x)] + A[gix)], so A is not a linear operator. 

(b) For the derivative operator, we have 

D[fix) + g(x)] = [fix) + gix)] = -f + ~T = D U(x)] + D[g(x)] 
ax ax ax 

and 

D[cfix)] = ^-[ cfix)] = c^f= cD[f (x)] 
dx dx 

So the derivative operator is a linear operator. 


Eigenfunctions and Eigenvalues 

Suppose that for a particular linear operator L, we can find a function fix) which 
has the property 


L[/(x)] = cfix) 


where c can be a real or a complex number. In other words, applying L to the 
function / simply gives us / back again multiplied by the number c. In this 
case, we say that / is an eigenfunction of L with eigenvalue c. The actual set of 
eigenfunctions, along with their corresponding eigenvalues, will depend on L. 


Example 2.5. The Eigenfunctions and Eigenvalues of the Derivative 
Operator. 

If g is an eigenfunction of the derivative operator D, it must satisfy 

(2.3) 


D[g(x)] = — = cgix) 
dx 





32 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 


This differential equation has the general solution g(x) — Ae cx , where A is an 
arbitrary constant, and c is the eigenvalue appearing in Equation (2.3). Hence, 
g(x) = Ae cx is the most general possible eigenfunction of D with eigenvalue c. 
Note that in this case, any complex number can be an eigenvalue of D. 

Of course, most functions are not eigenfunctions of a given operator. For ex¬ 
ample, in the case of the derivative operator, it is obvious that 

D[sin(.v)] = cos(.v) ^ csin(.r) 

D [.v 2 1 = 2x A 1 cx 2 

D[ln(x)] = \/x T^cln(.v) 

Thus, an eigenfunction is a very special sort of function. 


Example 2.6. The Eigenfunctions and Eigenvalues of the One-Dimensional 
Parity Operator. 

Find the eigenfunctions and eigenvalues of the one-dimensional parity operator, 
fl, defined by 


n[g(x)] = g(-x) 

i.e.. the parity operator reflects the function g(x) through x — 0 (Figure 2.6). 

Again, we must solve the equation n[g(x)| = eg (x ). Using the definition of 
the parity operator, we get 

FI [ g(x ) | = g( x ) = eg(x) (2.4) 

This equation has no obvious solution, so we apply n twice to the function g(.v): 

n 2 [g(x)] = n[g(-A-)] = gU) (2.5) 



FIGURE 2.6 The one-dimensional parity operator 11 rellects the function g(jr) through 
x = 0. 





Exercises 


33 


(Note that the notation L n [gU)] means to apply the operator// times to the function 
g(.v),e.g., L 3 [g(.v)] = L[L[L[g<x)J]],) If we assume that g(x) is an eigenfunction 
of n with eigenvalue c , we also have 

U 2 [g(x)] = Tl\cg(x)\ =cUlg(x)\ - c 2 g(x) (2.6) 

Combining Equations (2.5) and (2.6), we get 

g( x) = r 2 g(.v) 

which has the solutions c — ±1. Note that in contrast to Example 2.5. there are 
not an infinite number of eigenvalues but only two discrete eigenvalues. We can 
now determine what functions correspond to each eigenvalue. For c — +1, Equa¬ 
tion (2.4) gives g(—jt) — gU), so that g(x) is an arbitrary even function. For c — 
— 1, Equation (2.4) yields g(—x) = — g(x), so g(x) is an arbitrary odd function. 

Note that when an eigenfunction of a linear operator is multiplied by an arbitrary 
constant, it remains an eigenfunction with the same eigenvalue (Exercise 2,1 I). 


EXERCISES 

2.1 Evaluate all of the following, and express all of your final answers in the form a 4- M: 

(a) /(2 - 3/)(3 4-5/) 

(b) //(/ - 1) 

(c) (t + Z) 30 

2.2 In the complex plane, there are 5 different fifth roots of 1. Determine the five values 
for v 1. and express them in polar form. 

2.3 Suppose that £=14- e lfJ . Calculate £*. < 2 , and |:.| 2 . Your expression for |c| 2 should 
not contain any imaginary numbers. 

2.4 Suppose that a complex number c has the property that C — r . What does this indicate 
about.:? 

2.5 Reduce /' to a real number. 

2.6 What is wrong with the following argument? 

[T _ vT 
V -1 - ^3, 

yrr=i 


Therefore. 



(/)</) = i 
~~\ = 1 




34 


Chapter 2 Math Interlude A: Complex Numbers and Linear Operators 


2.7 Determine which of the following are linear operators, and which are not. 

(a) The parity operator FI [/ (x)] = f(—x). 

(b) The translation operator T[f(x)] = fix + 1). 

(c) The operator L[f(x)] = fix) + l, 

2.8 Consider the identity operator /, defined by I\f(x)] =s fix). 

(a) Show that / is a linear operator. 

(b) Find the eigenfunctions and corresponding eigenvalues of /. 

2.9 Suppose that the function fix) is an eigenfunction of the linear operator P with 
eigenvalue /;, and f (x) is also an eigenfunction of the linear operator Q with eigen¬ 
value q. Show that PQ[f(x)\ = QP[f(x )], where PQ[f(x)\ means to first apply 
the operator Q to fix), and then apply P to the result. 

2.10 Consider the square of the derivative operator D 2 . 

(a) Show that D 2 is a linear operator. 

(b) Find the eigenfunctions and corresponding eigenvalues of D 2 , 

(c) Give an example of an eigenfunction of D : which is not an eigenfunction of D. 

2.11 Let fix) be an eigenfunction of a linear operator L with eigenvalue a. Show that 
cf(x) (where c is a constant) is an eigenfunction of L with eigenvalue a. 

2.12 Consider the following operator L: 


L[f(x) J 


/'/«. 


(s) ds 


(a) Show that L is a linear operator. 

(b) Find the eigenfunctions of L, or show that L has no eigenfunctions. 



C HA PT E R 



The Schrodinger Equation 


The Schrodinger equation, developed by Erwin Schrodinger and published in 1926. 
forms the basis of modern quantum mechanics. Indeed, it is one of the most im¬ 
portant equations in all of physics, and much of the remainder of this book will be 
based on it. 

The way in which the Schrodinger equation describes the behavior of particles is 
fundamentally different from the corresponding description in classical mechanics. 
In classical mechanics, a particle has a fixed position in three-dimensional space, 
given by the vector r. This position is a function of time, t, so a complete description 
of the motion of the particle is given by its trajectory, r(/); the main problem in 
classical mechanics is to determine r(7). 

In quantum mechanics, in contrast, a particle no longer has a definite trajectory 
r(7). Instead, we start with the idea from Chapter 1 that matter can be treated 
as a wave. In particular, we will assume that any particle can be described by a 
wave function, 40 r, t), which gives the amplitude of the wave as a function of the 
three-dimensional position in space, r, and of the time, t. 

This leads to an obvious question: what is the physical meaning of ^(r, f), 
i.e., what does it tell us about the particle? Although we must abandon the hope 
of determining the position of the particle as a function of time, what we can 
derive from the wave function is the probability that the particle will be found in 
a given region of space at a given time. Furthermore, all of the other observable 
physical characteristics of this particle (e.g.. its momentum, energy, etc.) are related 
to ^(r, t). The relationship between observable quantities and Tqr, 0 will be 
examined in more detail in Section 3.2. First, however, we will derive the equation 
which determines ^(r. t)\ the Schrodinger equation. 


3.1 ■ DERIVATION OF THE SCHRODINGER EQUATION 

Unfortunately, it is no more possible to "'derive” the Schrodinger equation from 
purely mathematical arguments than it is to derive Newton’s law of gravitation 
or F = ma . The only basis for developing a physical theory is that it describes 
experimental reality. What we can do, however, is make some reasonable assump¬ 
tions and show that these lead to the Schrodinger equation. We will then solve the 
Schrodinger equation and discover that it does, indeed, make numerous correct 
predictions. In particular, it provides an accurate description of the hydrogen atom 
and many other phenomena at the atomic and subatomic scales. 


35 



Chapter 3 The Schrodinger Equation 



Fix x, vary t < > 
■' 

V 

V 

v 


FIGURE 3.1 The equation fit (x , t) = A cos(2nx/X - 2nvt) represents a wave with am¬ 
plitude A and wavelength A, oscillating in time with frequency v. 


For simplicity, consider first a wave in one dimension, travelling in the +x 
direction. A wave with frequency v and wavelength A can be written in the form 

fit(x, /) = A cos(2ttx/A — 2icvt) (3.1) 

What does Equation (3.1) mean? The shape of the wave can be derived by fixing 
t and treating fit as a function only of x. Then Equation (3.1) represents a wave, 
frozen in time, which oscillates sinusoidally as a function of x with wavelength 
A and amplitude A. Alternately, we can fix the position x on the wave to be a 
constant and consider how fit varies with t; in this case, we see that our fixed point 
on the wave oscillates up and down sinusoidally with frequency v and amplitude 
A (Figure 3.1). To simplify this wave equation, it is conventional to make the 
substitutions: 


k = 2tt/A 



3.1 Derivation of the Schrodinger Equation 


37 


and 


co = 2nv 


where k is called the wave number, and co is the angular frequency. In terms of k 
and co, Equation (3.1) simplifies to 


'I' = A cos (kx — cot ) (3.2) 

We note one other property of the wave described by Equation (3.2) (or Equa¬ 
tion 3.1); it represents a wave travelling to the right with velocity co/k. To see this, 
consider, for example, the maximum x max in the amplitude of the wave at x = 0, 
/ = 0. At some later time t, this will still be a maximum of the cosine function as 
long as kx max — cot = 0, or 


■^max ■— (C0/k)t (3.3) 

Equation (3.3) shows that the maximum moves in the +x direction, i.e., to the 
right, with velocity co/k. This same argument can be made about any other point 
on the wave, which means that the entire wave moves in the +x direction with 
velocity 


v — co/k 

This velocity is called the phase velocity (Figure 3.2). 

So far, we have confined our attention to one dimension, but Equation (3.2) can 
be generalized to three dimensions by using a three-dimensional position vector r 
in place of the one-dimensional position x. In this case, we must also take k to be 
a vector, k, called the wave vector , and Equation (3.2) becomes 

4>(r, t) = A cos(k • r — cot) 

From our previous argument, this represents a wave travelling in the k direction. 
However, this is not the most general possible form for such a wave. A sine function 
serves just as well as a cosine function to represent an oscillating wave, so the most 
general form for a wave moving in the k direction is 

^(r, t) = A] cos(k • r — cot) + A 2 sin(k • r — cot) (3.4) 

where A 1 and A 2 are constants which determine both the amplitude and phase of 
the wave. Although this is a perfectly acceptable way to represent a general wave, 
it is somewhat awkward. As an alternative, we can write 

'F(r, /) = Be i(k ' r ~ wt) (3.5) 

where B is now a complex number, and in classical mechanics, it is understood 
that we take the real part of the right-hand side of Equation (3.5). 




38 


Chapter 3 The Schrodinger Equation 


¥ 



FIGURE 3.2 The equation T (x. r) = A cos {kx - cot) represents a wave moving in the 
+x direction with phase velocity co/ k. 

Equations (3.4) and (3.5) are completely equivalent. To see this expand Equa¬ 
tion (3.5 ) out using Equation (2.2) and write B = B\ + i B 2 . Then Equation (3.5) 
becomes 

'Hr, t) = B\ cos(k - r — cot) — 62 sin(k • r — cot) 

+ iBj cos(k * r — cot) + iB\ sin(k • r — cot) (3.6) 

By choosing B\ — A\ and fi? = —^2 and taking the real part of the right-hand 
side of Equation (3.6), we get the same result as Equation (3.4). We will now use 
Equation (3.5) to represent our wave function. 

In order to derive a plausible equation for the wave function, we need to make 
a connection between k, to, and the momentum and energy of the particle. In 
Chapter 1 we noted two relations between the properties of waves and physically- 


3.1 Derivation of the Schrodinger Equation 


39 


measurable quantities. Specifically, for light, the energy E and the frequency v are 
related by 


E = hv 


and for matter waves, the momentum p and the wavelength k are related by 

P = hfk 

We will assume that both of these properties apply to matter waves. The equation 
for energy, E — hv, corresponds to 


E = (h /2 tt )(2tt v) = hco 

and the momentum equation, p = h/k, becomes 

p = (h/2jz)(2n/k) = hk (3.7) 

We can generalize this equation from one dimention to three dimensions. In three 
dimensions, k points in the direction of motion of the particle, which is also the 
direction of the momentum vector, p. Hence, Equation (3.7) becomes 


p = hk 


For a non-relativistic particle with mass m and with no potential energy, the rela¬ 
tionship between p and E is given by 



(3.8) 


It is now possible to generate expressions for E and p from Equation (3.5) 
by taking the appropriate derivatives. Taking the derivative of Equation (3.5) W'ith 
respect to time, we get 


9T' 

dt 


—icoBe 1 


(k*r—ftrf) 


Multiplying both sides by ih gives energy on the right-hand side: 


d'P , 

ih — = hwBe ,ik ' r ~ w,) 
dt 

= £v V (3.9) 

Note that ih('d/dt) is a linear operator, and T is an eigenfunction of this operator 
W'ith eigenvalue E. 

Now we need to find a similar operator which gives the momentum p. Expanding 
the dot product in Equation (3.5) gives 


vp(r. t ) = Be iik * x+k> ' y 




40 


Chapter 3 The Schrodinger Equation 


so that 


Then 


aq/ 

_ — ik g e i(.k x x+ky\+k,z-<ut) 

dx 


94/ 

dy 

94/ 

9 z 


_ jff e i(kxX+kyy+krZ-M) 
__ jjr J^gi(k x x-+-kyy-\-k z z~~(i>t) 


A 94/ „ 94/ a 94/ 

V»J/ = *-— +y—- +Z-7— 

av az 

= (/M + (IvV + /^i)5^/'(C.v+i,v+t- ; c-^) 

= /k4/ 


Since we want p = /tk to multiply 4/ on the right-hand side, we multiply both 
sides by —ih: 

-/7zV4/ = Z/k4/ = p4/ 

In summary, we now have two operators for which 4> is an eigenfunction; one 
gives the energy E as the eigenvalue, and the other gives the momentum p as the 
eigenvalue: 

94/ 

ih — = E 4/ 

9/ 

-ihV 4/ = p4/ 

We now derive an equation which corresponds to Equation (3.8). Applying the 
momentum operator — itiV twice to 4/ produces two factors of p: 


(— ihV) • (—ihV) 4/ = p • p4/ 

= p 2 4/ 

Hence, 

-h 2 v 2 4 / = p 1 4 / 

(Recall that V 2 is shorthand for V • V = d 2 /dx 2 + d 2 /dy 2 + d 2 /dz 2 .) Then 


£v 2 4/= ^-4/ 
2m 2m 


(3.10) 


The right-hand sides of Equations (3.9) and (3.10) are manifestly equal, since 
p 2 /2m = E. Then equating the left-hand sides of these equations gives us a dif¬ 
ferential equation satisfied by 4/: 


h 2 x —") . 94/ 

— V 2 4/ = ih—- 
2m dt 


(3.11) 



3.1 Derivation of the Schrodinger Equation 


41 


So far, all we have shown is that the wave function given in Equation (3.5) 
satisfies Equation (3.11), and further, that it is an eigenfunction of the operators on 
the left-hand and right-hand sides of Equation (3.11) with eigenvalues p 2 /2m and 
£, respectively. It is at this point that we make an unjustified leap: what happens 
if we now add a potential energy V to the system? In a classical system, the total 
energy is just the sum of the kinetic and potential energies: 


This suggests that we modify Equation (3.11) to read 

(3.12) 

(Note that unlike the kinetic energy and total energy, which correspond to operators 
containing various derivatives, the operator corresponding to the potential energy 
is simply multiplication of by V .) 

Equation (3.12) is the Schrociinger equation, possibly the most important equa¬ 
tion in all of 20th-century physics. Note that we have not derived this equation 
in a mathematical sense. We have constructed Equation (3.12) so that for V = 0, 
the equation will be satisfied by the wave function 4* = However, this 

particular wave function will not satisfy Equation (3.12) if V ^ 0. We simply pos¬ 
tulate that Equation (3.12) will give the “correct" wave function for any potential 
V. We will see that the predictions of the Schrodinger equation do agree with a 
wide variety of physical phenomena. 

In its most general form, 4* is a complex function of three spatial coordinates 
x, y, and z, and of the time t. The potential V, in general, is also a function of jc, y, 
c, and t. Hence, in the most general case, Equation (3.12) is really shorthand for 



h 1 / 3 2 4 / (x, y, t) 9 : T f (x, v. c, t) 3 2 4'(x, v\ z, t) \ 

2 rn \ dx 2 + 9_y 2 dz 2 ) 

+ V (x, v. 2 , y, 2,0 = ih ^-~ — 

' dt 

(3.13) 


Most practical applications of the Schrodinger equation involve a simplification 
of the most general case, e.g., motion in only one dimension, potentials which 
are independent of time, etc. For these cases, the Schrodinger equation assumes 
a much simpler form than Equation (3.13). For example, for a particle moving in 
one dimension, the Schrodinger equation becomes 





42 


Chapter 3 The Schrodinger Equation 


Example 3.1. A Solution of the One-Dimensional Schrodinger Equation. 
Consider the one-dimensional infinite square-well potential of width a, shown in 
Figure 3.3. The potential V (x) is given by 

V(x) =0, for 0 <x < a 
V(x) = oo, for x < 0 or x > a 

Of course, no physical potential can be truly infinite, but this potential will be a 
good approximation for any system with sharp potential barriers such that V 5?> E. 
The infinite potential barriers force ^(x, t) to be zero outside of the potential 
well, and give the boundary conditions 'F(0, /) = 0 and 4* (a, t) = 0. (This will be 
discussed in more detail in Chapter 4.) Note further that the potential in this case 
is independent of time. 

For 0 < x < a, the potential is zero, so the Schrodinger equation can be written 
as 


h 2 d 2 V(x,t) a*(x,o 

— = ,h -w- a,4) 

To solve this equation, we assume that the solution has the form 

t) = x)x(t) (3.15) 

where f is a function only of x and is independent of while x is a function only 
of t and is independent of How do we know that the solution of Equation (3.14) 
can be written in the form of Equation (3.15)? In fact, we don’t. The only way 
to determine if the solution has this form is to see if we can find functions \(r(x) 
and x (0 for which (jc, t) = if(x)x (0 satisfies Equation (3.14). However, many 


i j 

i i 

i i 

i i 

i i 

i i 

i i 



FIGURE 3.3 The infinite square-well potential 




3.1 Derivation of the Schrodinger Equation 


43 


partial differential equations in physics do, in fact, yield solutions of the form 
given in Equation (3.15). (Of course, we wouldn’t be introducing this solution if 
it didn’t work in this case!) This general method of solution is called separation 
of variables. 

Substituting Equation (3.15) into Equation (3.14) gives 


h 2 d 2 \(f(x) 
2m dx 2 


X(t) = ihf(x) 


9x(0 

dt 


and dividing both sides by {—h 1 /2m)\)r(x)x{t) yields 


1 d 2 ^{x) _ _ 2 mi_ _1_ djft) 

i/r(:r) dx 2 h x(0 dt 

Note that the left-hand side of Equation (3.16) is a function only of x and is 
independent of t, while the right-hand side is a function only of t and is independent 
of x. This apparent contradiction can be resolved by noting that there is only one 
function that satisfies both of these requirements: a constant, which is independent 
of both x and t. Setting both sides of Equation (3.16) equal to the constant C, we 
get 


d 2 jr(x) 
dx 2 


= C\j/{x) 


(3.17) 


and 


2mi dx(t ) 

~h~ Jt 


= C X (t) 


(3.18) 


where the partial derivatives have become total derivatives, since each equation 
now contains only a single independent variable. 

Consider first the equation for \/r(x). The general solution to Equation (3.17) 
is either a sum of two exponentials with real arguments (for C > 0) or a sum 
of two trigonometric functions with real arguments (for C < 0). However, the 
boundary conditions give an additional constraint on the solution. The infinite 
barriers produce 4'(0, t) = 0 and ^(a, t) = 0, which means that x(f(0) = ifs(a) = 
0. However, a sum of two exponentials with real arguments has at most one value 
of x for which f(x) = 0. Hence, tfr(x) must be a sum of trigonometric functions, 
namely 


i/(x) = A\ sin y/—Cx + A 2 cos V—Ca 


(3.19) 


where A\ and A 2 are constants, and C must be negative, so — C is positive. The 
condition that t/t (0) = 0 means that A 2 = 0 in Equation (3.19), giving 

rft (x) — A 1 sin V—Cjc 


while the condition ^(a) = 0 can be satisfied for 

yf^Ca = nn, n = 1,2,3,... 


(3.20) 



44 


Chapter 3 The Schrodinger Equation 


We will examine the general solution in more detail in Chapter 4; here we will 
simply make use of a single solution, n — 1, which gives 

C = -n 2 /a 2 


and 


if(x) 




Now we can solve for x(t) using Equation (3.18). Taking C = 
equation gives 


- 7 r 2 /a 2 in this 


which has the solution 


dx ihn 2 

dt 2ma 2 ^ 


X = A ie - ih7,2,/W 

Combining the solutions for \£t(jc) and / (t) into an expression for 4>(.r, t) gives 

4»(X, t) = A Sin e ~ih7i 2 t/Ima 1 , for o < x < a 

= 0. for x < 0 or x > a 

In this solution, A = A\Ai remains an arbitrary constant. However, the value of A 
will be determined in the next section, when we examine the physical interpretation 
of the wave function. 

Note that we have found only a single solution corresponding to n = 1 in 
Equation (3.20). The general solution, as well as the physical significance of the 
values for n, will be examined in the next chapter. 


3.2 BTHE MEANING OF THE WAVE FUNCTION 

In the previous section, we derived the Schrodinger equation, which describes the 
behavior of the wave function 4» associated with a particle moving in a potential V. 
This leaves an obvious question: what does 4/ tell us about the physical behavior 
of the particle? 

The interpretation of 'h was provided by Max Bom. Recall that 4'(r, t) is 
complex, so it cannot by itself represent a physically-measurable quantity. Born 
argued that the square of the absolute value of the wave function, [^(r, r)| 2 = 
4'*(r. r)4'(r, t), which is always a real number, gives the probability per unit 
volume of finding the particle at the position r at the time t. Hence is called 
the probability density. Since probability is a pure number, the probability density 
must have units of 1 /volume in three dimensions and 1/length in one dimension. 
Note that this represents the abandonment of one of the basic ideas of classical 



45 


3.2 The Meaning of the Wave Function 

mechanics: a particle no longer has a definite position in space that can be described 
as a known function of time. Instead, the wave function is used to calculate the 
probability of finding the particle in a given region of space. 

As an example, consider the wave function derived in Example 3.1 for a particle 
in an infinite square well: 

vp (x. r) = A sin | c,~’ hn ‘'l 2mu \ f or o < x - < a 

= 0, for x <0 or x > a (3.21) 

This solution satisfies the Schrodinger equation for an arbitrary value of A. 
This is a result of the fact that if 4< is any solution of the Schrodinger equation, 
then is also a solution for any complex number c: 

ft 2 , 

— V-vp + yvp 
2m 

h 2 , 

-c —V 2 vp -fcV'I' 

2m 

h 2 , 

'V-(cvp) + V{cV) 

2m 

Hence, the value of A cannot be determined from the Schrodinger equation alone. 
But it can be determined from the definition of the probability density. 

According to the Born prescription, the probability density is 

vrvp = (a* Sin ^ sin (iLl) j 

= |A| 2 sin 2 ^^j (3.22) 

for 0 < .r < a, and = 0 for x outside of this range. (Note also that the time t 
has dropped out of the expression for the probability density; for this wave function 
the probability density is independent of time.) Now we require that the probability 
density satisfy an additional requirement, namely 

(3.23) 

where the integral is taken over all of space. The justification for Equation (3.23) 
follows from the definition of the probability density; if 4>*(r, OfiMr, t) gives the 
probability per unit volume of finding the particle at position r, then the integral 
of this quantity over all of space gives the probability of finding it somewhere, 
which must be 1. A wave function which satisfies Equation (3.23) is said to be 
normalized. 



- ih ^ 

- aT 

94 * 

- ci n — 

3 1 

3 (c4/) 
= in - 

at 




46 


Chapter 3 The Schrodinger Equation 


v 

0 a 

FIGURE 3.4 The normalized wave function given by Equation (3.25) for a particle in an 
infinite square-well potential at time t — 0. 

For the infinite square-well wave function given by Equation (3.21), we can 
substitute fit* 4* from Equation (3.22) into Equation (3.23) to find the value of the 
constant A: 



so that 

IAS 2 = - (3.24) 

a 

If we take A to be a positive real number, then Equation (3.24) has a unique 
solution: A = s/2/a, and our normalized wave function is then 

V(x, t) = ^ Sin e ~ihn 2 t/2mcr , for () < * < a 

= 0, for x < 0 or x > a (3.25) 

This wave function is shown in Figure 3.4 at t — 0. 

Note, however, that Equation (3.24) has an infinite number of other solutions. We 
could just as easily have taken A = —s/2ja. More generally, it is easy to show (see 
Exercise 3.7) that A can be any complex number of the form A = e' 9 s/2/a, where 
6 is an arbitrary (real) constant. This leads to an obvious question: which value of A 
do we choose? The answer is that it doesn’t matter. We will see that all observable 
quantities depend only on |A| 2 , not on A itself. Hence, any value of A which 
satisfies Equation (3.24) will give the same predictions for any measurements that 
we make. In practice, it is conventional to take (as we have) A to be real and 
positive. 




3.2 The Meaning of the Wave Function 


47 


The Bom interpretation tells us that the probability per unit length of finding the 
particle at a point x is given by 4 >*(a\ f). Hence, the probability of finding 

the particle in some region is simply the integral of over that region. 

Example 3.2. The Probability of Finding a Particle at a Particular Location. 

Consider the particle described by the wave function of Equation (3.25). What is 
the probability P(x < e) that the particle lies within a distance e of the left-hand 
wall of the square well with e <5C a? 

The desired probability is given by 




6 2 T TCX 

- sin" — ax 
=o a a 



— sin — 
2tt a 


(3.26) 


In the limit where e <<C a, the second term in Equation (3.26) can be expanded as 
sin(x) % x — x } /6 + • • •, giving 


P(x < e) 



Although can be used to derive the probability of finding the particle at 
a particular location, it contains more information than this. We can treat as 
the probability distribution function for the position of the particle. To see how this 
works, consider first a much more practical example: the calculation of a grade 
point average. If / A is the fraction of a student’s total grades which are A’s, /b 
is the fraction which are B’s, and so on, then the student’s grade point average is 
simply 

grade point average = /a4 + /b 3 + /c2 + /fil + /f0 

Similarly, for any discrete distribution p(j), the average value of j (denoted (j)) 
is just 

u) = T t puy o.2i) 

j 

If the quantity of interest is continuous rather than discrete, then Equation (3.27) 
changes from a sum to an integral, giving 



where the integral is taken over all allowed values of x . 




48 


Chapter 3 The Schrodinger Equation 


Now consider a large number of identical particles all described by the wave 
function 'F, and suppose we measure the position of every one of these particles. 
Since 4* * 4* is the distribution function for each of the individual particle positions, 
it follows that the average position of the particles, {x), will be given by 


(*> 



dx 


(3.28) 


(Of course, Equation (3.28) gives the theoretical average position; the actual mea¬ 
sured average will tend toward this value as the number of measurements goes to 
infinity.) Note that we have written the right-hand side in a peculiar way, inserting 
x between ‘F* and *F. The reason for this will become apparent shortly. 


Example 3.3. The Average Position of a Particle. 

Consider the wave function for the infinite square well given in Equation (3.25). 
The mean value for the position of the particle is 


/ oo 

^*(x, r)jcvl/(jc, t) dx 

'DO 

7TX \ 

— ) xdx 
a > 


L 

a 

2 


00 

2 

- sin , 
a V 


2 ( 7TX ' 


This result, that (x) = a /2, is exactly what we would expect, since the wave func¬ 
tion is symmetric and is centered ata/2. 


What about other observable quantities? Consider, for instance, the momentum. 
First consider the special case where 4* is an eigenfunction of the momentum 
operator, so that 


-ihW - p»F 


or, in the one-dimensional case, 

—ih — = pvp 
dx 

For this special case, we have 

'‘DO / O 

* \ i fc i dr /,, 4- \ A ... __ i iTr * / 


J x[r*(x,t) ^—ih-~-\^(x,t)dx = J T»*(x, t)pA>{x, t) dx 


(3.29) 


assuming that T* is normalized. In this case, the integral in Equation (3.29) gives 
the momentum p. Now we postulate that even if ^ is not an eigenfunction of the 




3.2 The Meaning of the Wave Function 


49 


momentum operator, the average value of the momentum will be given by 


{p) = J ^>*(x,t) 


This brings us to one of the central ideas in quantum mechanics: every quantity 
we can measure, such as position, momentum, energy, angular momentum, etc., is 
associated with a corresponding linear operator. Measurable quantities are called, 
in quantum mechanics, observables, and we have already noted the correspondence 
between several observables and their operators: 


OBSERVABLE ** OPERATOR 

position 


momentum 

p *->• —ihV 


d 

energy 

E «-»• ih — 
dt 


When a wave function 4/ is an eigenfunction of an operator, then it represents 
a state in which the particle has a definite, fixed value for that observable. For 
example, if 

= e -iky-tM (3.30) 

then we have 

dV 

-ihVV = —ihx — 

3y 

= - hky 4* 

Thus, the wave function given by Equation (3.30) represents a particle in a state of 
definite momentum: it has momentum hk in the — y direction. If the momentum of 
this particle is measured, the result will be a single, definite answer: the momentum 
will be —hky. 

In classical physics, of course, all measurements behave this way. A particle is 
always assumed, for instance, to have a definite momentum. While experimental 
error may limit the precision with which the momentum of a particle can be mea¬ 
sured, it is always taken to be a well-defined, fixed quantity. This assumption is not 
true in quantum mechanics. Consider, for example, the square well wave function 
in Equation (3.25). If we apply the one-dimensional momentum operator to this 
wave function, we get 

-ih — V = -ih — 7?sin(— \e~ ihnh/2mal 
dx dx V a V a / 

= ^M-fecos(—)e- ih * 1, l 2mal 

a V a \ a / 





50 


Chapter 3 The Schrodinger Equation 


which is manifestly not equal to a constant times 4 *. Thus, the square-well wave 
function is not an eigenfunction of the momentum operator, and it represents a 
particle which is not in a state of definite momentum. Of course, if the momentum 
of the particle is measured, a definite value is always obtained, but there is no 
way to predict ahead of time what this value will be. This is one of the places 
where quantum mechanics differs radically from classical physics: particles can 
be in states in which the momentum, position, or energy of the particle is not 
well-defined. 

However, what is well-defined is the mean value of any observable, called the 
expectation value. The expectation value can be calculated for any observable, 
using the corresponding operator. If the operator O corresponds to an observable 
o, then the expectation value of o is just 


(o) = J v I / *(r, t)0^(r, t) d 3 r 


For example, for a one-dimensional wave function ^(x, t), we get 



(IMPORTANT: Note that the wave function must be normalized before the expec¬ 
tation value is computed!) 


Example 3.4. Expectation Value of the Momentum. 

Consider the expectation value of the momentum for a particle with the wave 
function given by Equation (3.25): 



= 0 


Thus, the expectation value of the momentum is zero. This means that if p is 
measured for a set of particles, all described by Equation (3.25), there is no way 





3,3 The Time-Independent Schrodinger Equation 


51 


to predict in advance the outcome of an individual measurement, but the mean 
value of p, calculated by averaging over many measurements, will tend toward 
zero. This result for the expectation value makes sense, since the corresponding 
classical system is a particle bouncing back and forth inside the potential well; its 
momentum averaged over many back-and-forth trips is zero. 


Using the same technique, the expectation value of functions of an observable 
can also be calculated. For a particle moving in one dimension, for instance, we 
have 

/ '"m 

'V*x 2 'l>dx 

-m 

{pl)= L * dx 


3.3 BTHE TIME-INDEPENDENT SCHRODINGER EQUATION: QUALITATIVE 
SOLUTIONS AND THE ORIGIN OF QUANTIZATION 

Derivation of the Time-Independent Schrodinger Equation 

In this section, we will consider a special case of the Schrodinger equation, but 
this special case will occupy much of the remainder of this book. For the most 
general version of the Schrodinger equation, 

h 2 , 3^ 

- + V(r. f)vF = ih — 

2m 3/ 

the various cases of interest are all embodied in the choice of the potential V (r, t). 
Now consider a special class of potentials: those for which V is a function only 
of position r, and is independent of time t. This is a very commonly encountered 
case; for example, the potential that an electron feels in an atom or in a crystal 
lattice can be treated this way. For this case, the Schrodinger equation becomes 

h 2 , 3^ 

+ )V = ih — (3.31) 

2m 3 1 

It is possible to make a further simplification by considering only a particular class 
of solutions. We will require that the wave function 'k be an eigenfunction of the 
energy operator, ih{d/dt), so that this wave function corresponds to a state of 
definite energy E. This means that 


ih 


t ) 
3 ~t 


~ E'Vi r, t) 


(3.32) 



52 


Chapter 3 The Schrodinger Equation 

Substituting this expression for the time derivative into Equation (3.31), we obtain 


h 2 

- — V 2 ^(r, t) + V(r)^(r, t) = £4>(r, t) (3.33) 

2m 

Equations (3.32) and (3.33) can be solved using separation of variables. Assume 
a solution of the form 


ty(r, t) = yHr)x(r) 


(3.34) 


where \j/ is a function only of position, and x i s a function only of time. Substituting 
this expression into Equation (3.32) gives 

ihf( r)“^ = Ex l f i r )x(t) 

Ot 

The factors of x//(r) cancel, and the resulting equation for x(0 reduces to 

dx(t ) IE 


dt 


h 


X(t) 


which has the solution 

x(t)==e -‘E</» (3.35) 

Now we substitute the expression for 4Mr, 0 from Equation (3.34) into Equa¬ 
tion (3.33) to yield 


ft 2 i 

- —V-^(r)x(0 + V{r)i(i(r)x(t) = Ex//( r)x(t) 
2m 


Here the factor x ( t ) cancels to give 


(3.36) 


Equation (3.36) is an extremely important version of the Schrodinger equation, 
called the time-independent Schrodinger equation. It can be used whenever the 
potential V is independent of time to find wave functions that are states of definite 
energy. 

Once a solution x[r( r) to the time-independent Schrodinger equation is derived, 
it can be used to recover the full wave function using the time-dependent piece of 
the solution in Equation (3.35): 


4<(r, t) = vHr)e lEt,h 


Often, however, the quantity of interest will be the time-independent wave function, 
VTr), because many physical quantities can be derived from tfr (r) alone, without 




3.3 The Time-Independent Schrodinger Equation 


53 


reference to the full wave function 'f'fr, t). For instance, the probability density 
for the particle. 4'*(r. /)*F(r. t ), is just 

vh*(r, O'f'fr. t) = if/(r)*e' E ' /r ’ ft(r)e~' hl ' n 
= i/*(r)xjf(r) 

We can also write expectation values in terms of \fr( r). The expectation value of 
any observable o which corresponds to an operator O that does not depend on time 
is _ 

(o) — j ir(r)*e lEr,>) Oijf(r)e~ lE!/h d~r 
= f \jf(r)*0\J/(r)d 3 r 


Clearly, both the probability density and the expectation values are independent 
of time. Of course, these arguments apply only to states of definite energy, which 
satisfy the time-independent Schrodinger equation. For this reason, such states are 
also called stationary states. Of course, the wave function itself still has a time 
dependence, given by the factor e ~' £,/7 \ but when the observable o is actually mea¬ 
sured, the time dependent factors cancel out in the calculation of the expectation 
value: e‘ El/h e ,E,/h — I. 

The time-independent Schrodinger equation suggests the definition of a new 
operator, H, given by 



V 2 + V 


(3.37) 


Then the time-independent Schrodinger equation can be written in the compact 
form 


7/i/r = Ef 


so that £ is the eigenvalue of H. The operator H defined by Equation (3.37) is called 
the Hamiltonian operator, or just the Hamiltonian. (The Hamiltonian is named after 
William Hamilton, an Irish mathematician who died in 1865, long before the birth 
of quantum mechanics. The name originated in classical mechanics, where H 
corresponds to the total energy of a particle. ) 

Qualitative Solutions and the Origin of Quantization 

In attempting to solve the time-independent Schrodinger equation, the first point 
to note (see Exercise 3.8) is that if ^i(r) and t/q^r) are two different solutions 
of the time-independent Schrodinger equation with the same value of E, then 
i/q (r) + xj/ii r) is also a solution with energy E. Furthermore, if \j/{ r) is a solution 






54 


Chapter 3 The Schrodinger Equation 


with energy E, then ci/r(r) (where c is any complex number) is also a solution 
with energy E. Thus, any solution of Equation (3.36) can always be multiplied by 
an arbitrary constant to obtain another solution, so we always have the freedom 
to normalize the wave function by the appropriate choice of the multiplicative 
constant. 

In Equation (3.36), the potential V (r) will generally be determined by whatever 
system is under consideration. But how do we determine E, the total energy of 
the system? This is generally not something that is specified in advance. However, 
for many potentials, it is not possible to find solutions for arbitrary choices of £; 
rather, solutions exist only for certain discrete choices for E. This is the origin of 
energy quantization. 

To see how this arises, consider the one-dimensional version of Equation (3.36): 


h 2 dH_ 

2m dx 2 


+ V (x)xff = Ey/r 


where t/t is a function of x. This equation can be rewritten as 


d z i/ 

dx 2 


2m 

T 2 


[V(x) - E]r(r 


(3.38) 


Assume that we have a particular potential V (x) and E, and we want to solve 
for 

Equation (3.38) can be used to determine the sign of d 2 \Js/dx 2 , which gives the 
direction in which the wave function curves: if d 2 xj//dx 2 > 0, then xj/(x) is concave 
up, while if d 2 \fr/dx 2 < 0, then i j/(x) is concave down. Then for E < V(x), the 
factor multiplying i j/ on the right-hand side of Equation (3.38) will be positive, 
so that d 2 i/s/dx 2 and xfs will have the same sign, either both positive or both 
negative. If t/t > 0 and d 2 i//dx 2 > 0, then xjr lies above the x-axis and is concave 
upward, while if \}r < 0 and d 2 \j//dx 2 < 0, then ^ lies below the x-axis and is 
concave downward. In either case, i/r(x) will curve away from the horizontal axis 
(Figure 3.5). 

On the other hand, if E > K(x) we can draw the opposite conclusion. In this 
case, the factor multiplying i/f on the right-hand side of Equation (3.38) will be 
negative, so that d 2 xjs/dx 2 and ?// will have opposite signs. Then when ^ lies above 
the horizontal axis, the function i/V (x ) will be concave downward, while if if/ lies 
below the horizontal axis, it will be concave upward. In either case, xj/(x) will 
curve toward the x-axis (Figure 3.6). 

These different possibilities have a clear interpretation in the classical case. If 
E < V(x) outside of a finite region of space, then the particle is bound in that 
potential, while E > V (x) corresponds to an unbound particle. Consider a bound 
particle first with the potential and energy shown in Figure 3.7. 

A classical particle with the indicated energy will oscillate back and forth in the 
region between —xj and x \, i.e., in the region for which E > V (x). The velocity of 





3.3 The Time-Independent Schrodinger Equation 


55 





FIGURE 3.6 If d 2 \J//dx 2 < 0 for ^ > 0. or d 2 ij//dx 2 > 0 for < 0, then \jj curves 
toward the x-axis. 


this particle is given by conservation of energy: mv 2 /2 = E — V (jc), This velocity 
reaches zero at x = —x\ and x = x\, and the particle moves back in the opposite 
direction. Note that the particle can never enter the regions x > x\ ox x < —x\., 
because in these regions it would have negative kinetic energy, which is impossible. 
Hence, this is called the classically forbidden region. 

Now we can sketch a qualitative solution to the time-independent Schrodinger 
equation using the behavior of i/r illustrated in Figures 3.5 and 3.6. Taking rf/ > 0, 
we note that \l/(x) will be concave down in the region — x\ < x < x\ and concave 
up outside this region. One solution, therefore, will behave as in Figure 3.8. This 
solution corresponds to the lowest-energy bound state, so it is called the ground- 
state wave function. 



56 


Chapter 3 The Schrodinger Equation 



FIGURE 3.7 A particle with energy E moves in the potential shown here. Classically, 
the particle is bound in the region —jci < x < x\. 


<P 



FIGURE 3.8 A function i/r(x) corresponding to the energy and potential shown in Fig¬ 
ure 3.7. ijf(x) is concave down in the region —x\ < x < ,V| and concave up on either side of 
this region. This function does not cross the .v-axis, and it corresponds to the ground state. 


Now we see another stark contrast between the predictions of quantum mechan¬ 
ics and classical mechanics. The function i/r is nonzero in the classically forbidden 
region, so there is a nonzero probability to find the particle in this region. The 
particle can penetrate into a region that classical mechanics predicts it does not 
have enough energy to reach! This effect has far-reaching consequences, some of 
which will be discussed in the next chapter. 

Note further that xj/ must be very finely tuned in the classically forbidden region. 
When E < V (jc), so that i//(x) curves away from the x-axis, the function has a 
tendency to “blow up” to positive or negative infinity. This is bad because then the 
wave function cannot be normalized to give a meaningful probability density. Only 
by choosing the amplitude just right will i/r(.r) decline smoothly to zero, always 





3.3 The Time-Independent Schrodinger Equation 


57 


i// 



FIGURE 3.9 If the energy in Figure 3.7 is changed slightly, there is no longer a well- 
behaved solution to the Schrodinger equation; the wave function blows up to either -foe 
or —oo. 


with positive curvature, as x goes to infinity. This “dangerous” behavior of the 
wave function in the classically forbidden region has an important consequence. 
Suppose we change the energy by a tiny amount, from E to E\ Then, if we choose 
\j/ to go to zero on the left-hand side, when x —► —oo, it will have the “wrong” slope 
when it enters the classically forbidden region on the right-hand side. Depending 
on the actual solution we choose, the function will either blow up toward +oo or 
cross the x-axis and fall away to —oo (Figure 3.9). 

Does this mean that the time-independent Schrodinger equation has only a 
single solution for this particular potential? No. If the energy is increased by a 
large enough amount, a second acceptable solution fon^(x) is encountered, which 
is shown in Figure 3.10. This state is called the first excited state , since it has the 
lowest energy of any state beyond the ground state, (The next state is called the 
second excited state , and so on.) Note further that fi(x) for the first excited state 
crosses the x-axis exactly one time, while the ground state function does not cross 
the x-axis at all. This corresponds to a general result for bound states: fi(x) for the 
ground state crosses the x-axis zero times, fi(x) for the first excited state crosses 
the x-axis one time, \j/(x) for the second excited state crosses the x-axis two times, 
and so on. 

Thus, for this potential, the bound-state solutions of the Schrodinger equation 
occur only at fixed, discrete values of the energy E. This is the origin of en¬ 
ergy quantization: for states which are classically bound, the time-independent 
Schrodinger equation has solutions only at discrete values of the energy. So a par¬ 
ticle in a bound state is no longer free to have any energy at all; it can take on only 
energies which correspond to solutions of the Schrodinger equation. 

Now consider what happens for an unbound particle. Using the same potential, 
we consider the behavior of the particle when its energy is larger than the potential 



58 


Chapter 3 The Schrodinger Equation 





FIGURE 3.10 A second function \j/ (x) corresponding to the potential shown in Figure 3.7, 
with a larger energy than displayed in that figure. ir(x) is concave up for x < —q and 
0 < x < .q, and concave down for —x\ < x < 0 and x > .q. This function crosses the 
x-axis once and corresponds to the first excited state. 



FIGURE 3.11 A particle with energy £ moves in the potential shown here. Since E > 
V(x) everywhere, the particle is not bound in this potential. 


everywhere (Figure 3.11). In this case, V(x) — E < 0, so rj/(x) curves toward the 
x-axis everywhere. Now' i j/ can no longer “blow up,” since it never curves away 
from the x-axis. Therefore, if E is changed slightly, the Schrodinger equation still 
yields an acceptable solution for i/Tx). So while the bound states are quantized, 
the unbound states are not; they can have any energy (larger than U(x)) and are 
not restricted to a set of discrete energies. Note also that a single potential, such 
as the one discussed here, can have both bound and unbound states. 






Exercises 


59 


EXERCISES 

3.1 A particle of mass m is moving in one dimension in a potential F(x, t). The wave 
function for the particle is 

— A xe -(v / kni/2ft)x 2 e ~i JkTrnO/2)t 

for —oo < x < -foe, where k and A are constants. 

(a) Show that V is independent of t, and determine V (x). 

(b) Normalize this wave function. 

(c) Using the normalized wave function, calculate (x), (x 2 ), (/?), and (p 2 ), 

3.2 Determine which of the following one-dimensional wave functions represent states of 
definite momentum. For each wave function that does correspond to a state of definite 
momentum, determine the momentum. 

(a) ir(x)=e ik * 

(b) ijf (x) = xe lkx 

(e) tfr(x) ~ sin(Uv) -F i cos (kx) 

(d) f(x) = e ikx +e’~ ikx 

3.3 The wave function for a particle is 4>(x, t) = sin(A;x)[z cos(eut/2) -f sin(mf/2)], 
where k and co are constants. 

(a) Is this particle in a state of definite momentum? If so, determine the momentum. 

(b) Is this particle in a state of definite energy? If so, determine the energy. 

3.4 A particle with mass m is moving in one dimension near the speed of light so that the 
relation 

E = p 2 /2m 

for the kinetic energy is no longer valid. Instead, the total energy is given by 

E = p c + me 

Hence, we can no longer use the Schrodinger equation. Suppose the wave function 
^ (x, t) for the particle is an eigenfunction of the energy operator and an eigenfunction 
of the momentum operator, and also assume that there is no potential energy V . Derive 
a linear differential equation for vf/(x, t). 

3.5 A particle with mass m is moving in one dimension in the potential V (x). The particle 
is in a state of definite energy E, but it is not in a state of definite momentum p. Show 
that 

ip 2 ) 

^- 1 + (V(x))^E 

2m 

3.6 Consider the solution to the Schrodinger equation for the infinite square well with 
n s= 2 rather than n = 1 in Equation (3.20). Derive ^ (x, t ) for this case, and normalize 
this wave function. 

3.7 Suppose that a wave function Tqr. t) is normalized. Show that the wave function 

0, where 0 is an arbitrary real number, is also normalized. 



60 


Chapter 3 The Schrodinger Equation 


3.8 Suppose that \j/\ and \f/ 2 are two different solutions of the time-independent Schrodinger 
equation with the same energy E. 

(a) Show that \j/\ + ^2 ls a ^ so a solution with energy E. 

(b) Show that o/o is also a solution of the Schrodinger equation with energy E. 

3.9 A particle moves in one dimension in the potential shown here. The energy E is shown 
on the graph, and the particle is in its ground state. 



(a) Sketch if/(x) for this particle. 

(b) You make a measurement to find the particle. Indicate on your graph the point or 
points at which you are most likely to find it. 

3.10 A particle moves in one dimension in the potential shown here. The energy E is shown 
on the graph, and the particle is in its first excited state. 



(a) Sketch i/(x) for this particle. 

(b) You make a measurement to find the particle. Indicate on your graph the point or 
points at which you are most likely to find it. 

3.11 A particle moving in one dimension is described by the function yjf(x) shown here: 



(a) You make a measurement to locate the particle. Which one of the following is 
true? 

i. You will always find the particle at point B. 

ii. You are most likely to find the particle at points A or C and least likely to find 
the particle at point B. 








CHAPTER 



Solutions of the One-Dimensional 
Time-Independent Schrodinger 

Equation 


In this chapter we will examine some exact solutions of the one-dimensional time- 
independent Schrodinger equation. 


h 2 dhp_ 

2m d.x 2 


+ V(x)i jr = E ^ 


(4.1) 


Of course, the real world is three-dimensional, but Equation (4.1) can be applied 
whenever a particle moves only in a single direction. Consider, for example, 
an electron travelling through an evacuated tube (Figure 4.1) with an elec¬ 
trostatic potential 4>(x). Since the electron has charge —e, it experiences the 
potential V(x) = —<?4>(x), and this system can be treated just as effectively 
one-dimensional. Another example is the motion of electrons in semiconductor 
heterostructures. These are materials formed by joining together two or more 
different semiconductors. At the junction between the materials, the electron ex¬ 
periences a change in the potential, and it is possible to construct materials with 
potentials that mimic some of those considered in this chapter. 

Classically, there are two possible types of states: hound states, in which the 
particle is confined to move in a finite region, and unbound states , in which the 
particle can escape to infinity. Because the solutions for unbound states are quite 
different from those of bound states, we will treat the two cases separately. For 
unbound states, we will calculate the probability that an incident particle will 
reflect off of or transmit through a particular potential. For bound states, we will 
calculate the wave functions and energies. As noted in the previous chapter, the 
bound state energy levels will, in general, be discrete rather than continuous. 


4.1 ■ UNBOUND STATES: SCATTERING AND TUNNELING 

In this section we will consider only the case of piecewise constant potentials, i.e., 
potentials with step-like discontinuities (Figure 4.2). Consider a range in x over 
which V (jc) is a constant, Vo- Then Equation (4.1) becomes 

h 2 d 2 \ff 

+ Vo i/r = Ef 

2m «.v- 


v 


63 




64 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


V(x) = -e$(x) 



I——* 


- x 

FIGURE 4.1 An electron moving in one dimension in the potential V(x) = —e<P(x). 


which can be rewritten as 


d 2 f 

7 ^ + 


2m(E — Vo) 

P 


rjr = 0 


(4.2) 


This is the simplest possible version of the time-independent Schrodinger equation. 
To find a solution, consider the general equation of the form 



T Ay — 0 


(4.3) 


where A is an arbitrary constant. We try a solution of the form 


y = Ce 


mx 


where both C and m are constants. Then 



= Cm 2 e ,nx 


Substituting these expressions for d 2 y/dx 2 and y into Equation (4.3), we obtain 

rn 2 Ce mx + ACe mx = 0 






4.1 Unbound States: Scattering and Tunneling 


65 


V(x) 


x 

FIGURE 4.2 An example of a piecewise constant potential. 

which is satisfied as long as m is chosen so that m 2 + A = 0, while C can have 
any value at all. Thus, m = A, and the general solution is 

(4.4) 

where C\ and C 2 are arbitrary constants. (Note that C| and C 2 need not be real; 
they can also be complex numbers.) 

This solution will have quite different behavior depending on whether A is 
positive or negative. If A is negative, then the quantities under the square roots 
will be positive, and the solution given by Equation (4.4) will be the sum of a 
positive and a negative exponential. On the other hand, if A is positive, then +J—A 
will be imaginary. In this case we take -J—A = i \f~A, and Equation (4.4) becomes 

(4.5) 

Note that this solution can also be written as a sum of trigonometric functions (see 
Exercise 4.1): 

(4.6) 

The solutions given by Equations (4.5) and (4.6) are completely equivalent, so the 
question of which one to use is a matter of convenience; it will usually be easier 
to use one form or the other depending on the boundary conditions in the problem 
at hand. 


y = D\ cos(v / Ax) + D 2 sin(VAx) 










66 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


ilfU) 

4 





OK! 


I-► .V 

FIGURE 4.3 Both rfr(x) and d\j/{x)/dx must be continuous. 


For a particle with a given energy E moving in a piecewise constant potential 
with step discontinuities such as in Figure 4.2, we will obtain different solutions 
in the regions with different values of V 0 . The constants appearing in the solutions 
must then be chosen so that the wave functions “join up” at each step, i.e., both 
i^(x) and d\jr(x)/dx must be continuous (Figure 4.3). 

We can now apply this set of solutions to the Schrodinger equation in the form 
of Equation (4.2). Consider first the simplest case, where Vo = 0, so we are dealing 
with a “free particle” and no potential. In this case we clearly have E — V o > 0, 
so we choose a solution of the form given in Equation (4.5) or (4.6). The former 
will be more convenient for our current purposes; we obtain 


f = Cx e'(' / ^ lh > + c 2 e- i( ~ V ^ /n > 


The physical interpretation of this solution is clearer if we use it to derive the 
time-dependent wave function, 

vF(x,t) = ir(x)e~ iEt/h 

_ q ^(d2mE/h)x-Er/h^ _j_ ^_ ) ^'[-(v / 2m£ r /fi).r-£f//ij (4 7) 


Now recall that energy is related to momentum via E = p 2 /2m, so that -Jlm E = 
p. Furthermore, in the previous chapter we derived relations between momentum 




4.1 Unbound States: Scattering and Tunneling 


67 


and wavenumber ( p = hk) and between energy and frequency (E = hco). Substi¬ 
tuting all of these into Equation (4.7), we get 


W{x, t ) = C x e i(kx - Mt) + C 2 e i( - kx -° ,t) (4.8) 


We have seen wave functions of this form already in Chapter 3; the function 
e i(kx-cot) re p resen t s a wave moving in the +x direction (i.e., to the right in a 
conventional coordinate system) with energy hco and momentum hk, while the 
second term represents a particle moving to the left with energy hco and momentum 
—hk. We know that Equation (4.8) is an eigenfunction of the energy operator, since 
it was derived using the time-independent Schrodinger equation. Now consider 
what happens if we apply the momentum operator to each term separately: 

a 

-ih~Cie ak *- M,) = hkC\e i{kx ~ wt) 

3jc 

q 

-ih~C 2 e i( ~ kx - M) = -hkC 2 e i( - kx ' mt) 
dx 

As expected, the first term in our wave function, which represents a rightward 
moving particle, is an eigenfunction of the momentum operator with momentum 
hk. Similarly, the second term is an eigenfunction of the momentum operator with 
momentum —hk. On the other hand, it is easy to verify that the total wave function 
(if Ci 0 and C 2 # 0) is not an eigenfunction of the momentum operator (see 
Exercise 4.2). 

It might seem puzzling that the general form for the wave function of a free 
particle moving in one dimension consists of pieces representing particles simul¬ 
taneously moving both to the left and to the right. Consider, however, a parti¬ 
cle reflecting off of a boundary. The wave function in this case simultaneously 
represents both the rightward-moving incident particle and the leftward-moving 
reflected particle. Unlike classical mechanics where the motion of the particle is 
specified by its position as a function of time x(t), and x simply tracks the particle 
as it first moves to the right and later moves to the left, the quantum mechani¬ 
cal wave function 4* simultanously encodes the particle moving to the right and 
reflecting back to the left. 

The constants C\ and C 2 in Equation (4.8) are free parameters which can be 
chosen to match the physical system. Hence, a free particle moving purely to the 
right can be expressed in terms of Equation (4.8) by setting C 2 — 0. Similarly, a 
free particle moving to the left has a wave function given by Equation (4.8) with 
Cj = 0. 

What do we know about the position of the particle? Consider the wave function 
for a purely rightward-moving particle, 


*(x,t) = C\e i(kx ~ w,) 


(4.9) 



68 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 





FIGURE 4.4 for a wave packet generated by summing over all waves in the interval 

k = 0 to k = k 0 , where each wave is given equal amplitude. 


Calculating gives 


V|/*\p — c* e -i(kx-a>t)c^gi(kx-a>t) 


= C* x Ci 


which is independent of position. This means that the particle is equally likely to 
be found anywhere in space! In fact, Equation (4.9) represents an idealization. In 
theory, a particle which is in an exact momentum state will be spread out over 
an arbitrarily large distance, but in practice, a physical particle will correspond 
to a sum of waves like those in Equation (4.9), each with a slightly different 
momentum and energy (or, equivalently, a slightly different k and oj). These waves 
can be summed to produce a value for vy*\{/ which is localized in space. Such a 
sum of waves is called a wave packet, and it can be normalized. For example, in 
Figure 4.4, is shown for a wave packet generated by summing over all waves 
in the interval k — 0 to k = ko, where each wave is given equal amplitude. 

Scattering From Step-Function Potentials 

In solving the Schrodinger equation for scattering in one dimension, we will ide¬ 
alize the situation and treat the particles as eigenfunctions of momentum; we will 
still be able to derive physically-interesting results for this case. Consider first the 
“step-function” potential shown in Figure 4.5. The physical location of the step in 
the potential is arbitrary, so we will take it to lie at x =0. Similarly, as in classical 
mechanics, we are free to choose the zero of the potential anywhere, so we will 
take V = 0 for x <0, while V = Vo for x > 0. 

We can distinguish two different cases here: either E < V 0 , or E > Vo- In a 
classical system, the behavior of the particle is easy to calculate. Consider first a 
classical particle moving from left to right with energy E < Vo. The particle will 
not be able to enter the region x > 0, and it will simply bounce back to the left 
with the magnitude of its velocity unchanged. A classical particle with E > V o will 



4.1 Unbound States: Scattering and Tunneling 


69 



FIGURE 4.5 A step-function potential. 


penetrate into the region x > 0, but the magnitude of its velocity will decrease as 
its momentum changes from p\ — JlmE on the left-hand side of the barrier to 
P 2 = \f2m(E — V 0 ) on the right-hand side. Quantum mechanics predicts a very 
different behavior as we shall now see. 


Step-Function Potential With E > V o 

Consider first the high-energy case for which a classical particle will simply travel 
into the x > 0 region with reduced momentum. For this case, the Schrodinger 
equation for a constant potential. Equation (4.2), has the form 


d 2 ifr 
dx 2 


2m E 

IF* 


= 0 


(4.10) 


for x < 0, and 


d 2 ^r 

5? + 


2m(E - V 0 ) , 

—P - * 


= 0 


(4.11) 


for x > 0. Since both E and E — Vo are positive, our solution in both regions can 
be expressed in the form of either Equation (4.5) or (4.6). Either form can be used 
to solve the problem, but the solution will turn out to be easier to interpret using 
exponentials (Equation 4.5) rather than trigonometric functions. The solution to 
the Schrodinger equation in these two regions is 


^[(x) = A i e‘('l2™E/ h ) x -f B ie -i(d2mE/tl)x^ 

\^2 (x) = A if 1 (d- m (E—Vo)/h)x _|_ ft 7e -i (v / 2m(£-Vo)/fi)x ^ 


x < 0 





70 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


where Ay, By , A 2 , and 62 are constants which need to be determined from the 
boundary conditions. To simplify the calculation, we make the substitutions 

, %/ 2m E 


, V2 m(E - V 0 ) 

k 2 =--- 

h 

so that hk\ and hk 2 give the magnitude of the momentum of the particle on the left 
and right sides of the step, respectively. Then the solution becomes 

xl/iix) = A t e ik,x + B { e~ ik ' x , x < 0 (4.12) 

r hJx) = A 2 e iklX 4- B 2 e~ ikzX , x > 0 (4.13) 

As noted previously, the constants A\. By. A 2 , and B 2 must be chosen so that the 
wave function and its derivative are continuous at the boundary between the two 
solutions: 


i//i(0) = r/r 2 (0) 


and 


dxlr 1 d\!/-> 

(0) = -f^(0) 


dx 


dx 


However, before applying these boundary conditions, it is useful to understand 
the physical significance of the terms in Equations (4.12) and (4.13). The time- 
dependent wave functions for the particle, derived from Equations (4.12) and 
(4.13), are 


4/,(x. 0 = Aye iik ' x ~ E,/h) + B ye i( ~ k ' x - E ' /h \ x < 0 (4.14) 

TCU,/) = A 2 e nk2X ~ E,/h) + B 2 e u ~ klX - E,/h) , x >0 (4.15) 

Thus, the “A 1 " term represents a rightward-moving particle on the left side of the 
step, the “fi 1 ” term represents a leftward-moving particle on the left side of the step, 
the “AC' term represents a rightward-moving particle on the right side of the step, 
and the "B 2 ” term represents a leftward-moving particle on the right side of the 
step (Figure 4.6). However, not all of these terms make physical sense. Assuming 
that the particle is initially in the region x < 0 , travelling to the right, we expect 
Ai ^ 0. Classically the particle will travel across the step and continue travelling 
to the right, so we expect A 2 7 M). We also want to allow for the possibility that 
the particle can scatter backwards off of the step; although this cannot happen 
classically, it cannot be ruled out in a quantum system, so we must allow for 
By tA 0 . However, one term makes no sense at all: the "B 2 " term represents a 
particle originating at x = +00 and moving to the left. There is no way to produce 
such a particle trajectory from a particle initially moving to the right on the left- 
hand side of the step. Therefore, on physical grounds, we set B 2 — 0. 



4.1 Unbound States: Scattering and Tunneling 


71 



FIGURE 4.6 In Equations (4.14) and (4.15), the “A i ” term represents a rightward-moving 
particle on the left side of the step, the term represents a leftward-moving particle on 
the left side of the step, the t4 A 2 ” term represents a rightward-moving particle on the right 
side of the step, and the term represents a leftward-moving particle on the right side 
of the step. 


Then our wave functions simplify to 

^,(jc) = Aie ik ' x + Bie~ ik '\ x < 0 

ij /2 (x) - A 2 e ikl - X , x>0 (4.16) 

Using our two boundary conditions at x = 0, we find the requirement that t/q (0) = 
^ 2 ( 0 ) gives 

A i 4” B\ = A 2 (4,17) 

while the requirement that d^\ jdx = d^/dx at x = 0 yields 

ik]A x — ik\B\ = ik 2 A 2 (4,18) 

Since k\ and & 2 are specified by the values of E and Vo, we have three unknowns, 
A i, fi], and A 2 , and only two equations constraining them, Equations (4.17) and 
(4.18). Hence, we cannot solve for Ai, B\, and A 2 , but we can express two of 
the unknowns in terms of the third. Keeping A 2 as our only unknown, we get the 



72 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


solution 


and 



(4.19) 


(4.20) 


so that the wave functions become 



This is the complete solution, but what does it mean physically? Recall that for a 
normalized wave function, the probability of finding a particle at a given location 
x is given by i//(.v)*i//(.v). Our wave functions are not so well behaved, since 
they represent an idealized, single-momentum state. However, it is still possible to 
derive useful results from them. In particular, we can define the probabilities that 
the particle will be reflected at the step (R) and that it will be transmitted across 
the step (T). In analogy to results from classical electromagnetism, we take 


and 


(reflected amplitude) 2 
(incident amplitude) 2 


(transmitted amplitude) 2 
(incident amplitude) 2 

We require that R + T = 1, since the particle must be either transmitted or re¬ 
flected. Since we are dealing with complex quantities, the square of the incident 
amplitude is A* At, and the square of the reflected amplitude is B*B\, so 

(4.21) 

(Note that in general, the terms giving the transmitted and reflected amplitudes 
will also include a factor depending on the velocity of the particle, but these 
factors cancel in Equation (4.21), since they are evaluated in the same region.) The 
transmission probability is then simply 



T = 1 - R 






4.1 Unbound States: Scattering and Tunneling 


73 


Substituting our amplitudes from Equations (4.19) and (4.20) into Equation (4.21) 
gives us the reflection probability 


(ki - * 2 ) 2 

(*t + k 2 ) 2 


(4.22) 


while 


T = 1 - R = 


(ki - k 2 ) 2 
(ki + k 2 ) 2 


This is a truly remarkable result; it tells us that there is a nonzero probability that the 
particle will not be transmitted across the step but will actually reflect backwards! 
The probability of reflection (Equation 4.22) can be rewritten in terms of E and 

V 0 : 


yi-VE- Vo \ 2 
vT+yF^To/ 


It is reasonable to ask why such reflection is not observed in classical systems. 
For example, suppose we have an electron moving through a region of space in 
which the electric potential changes abruptly. If we take the electron to have an 
energy equal to twice the potential step, E = 2 Vo> we obtain R = 0.17, a nonneg- 
ligible reflection probability. The answer lies in our assumption that the potential 
is an infinitely sharp step. In any real physical system, the step-function potential 
will have a nonzero width in the x direction. As long as this width is much larger 
than the de Broglie wavelength of the scattering particle, the system will lie in the 
classical regime, and the calculation we have just performed will be invalid. 


Example 4.1. When Is a Step-Function Potential in the Quantum Regime? 

As an example, consider an electron accelerated through a 100 V potential differ¬ 
ence. How narrow would the step-function potential need to be in order to observe 
quantum effects? 

The energy of the electron in this case is 


E = e<P 


= (1.6 x 10~ 19 C)(100 V) 
= 1.6 x 10~ 17 J 






74 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


Its de Broglie wavelength is then 
X = h / p 
- h/sflmE 

= (6.6 x IQ- 34 J s)/V'(2)(9.1 x 1C)- 31 kg)(1.6 x 10 17 J) 

= 1.2 x 1(T 10 m 

Thus, the step in the potential would need to rise from 0 to Vo hi a length less 
than 10“ 10 m, about an atomic radius. In practice, the step width would need to be 
much larger than this in order for the system to lie in the purely classical regime. 

In general, quantum effects in step potentials can only be seen at the atomic or 
nuclear level. An important example (a decay) will be discussed below. 

Step-Function Potential With E < V 0 

Now consider scattering from a step-function potential when the energy of the inci¬ 
dent particle is less than the height of the step. Again, the Schrodinger equation will 
be given by Equations (4.10) and (4.11), but now E — To will be negative. Hence, 
in the region x < 0, the solution to the Schrodinger equation will be identical to 
what we obtained in the previous section: 

if /, (x ) = + B]e -i(V2FFE/h) x 

However, in the region x > 0, we expect a solution of the form given by Equation 
(4.4), i.e., a solution which looks like 

rj/ 2 (x) = A , e (V2m(V n -E)/fi)x _j_ g^ e -(J2m(V 0 -E)/fi)x 

Note that the quantity under the square root in the exponential is positive (since 
Vq — E > 0), so the exponentials are real. As in the previous section, we define 



As before, hk\ is the magnitude of the momentum of the particle on the left side 
of the step, but hko does not have a similar physical significance. Now we write 
the solution as 

Vh (x) = A i e ik '' + />*, e~ ik v , x < 0 
( x ) — We klX + • x > 0 

As in the previous section, we can make a physical argument to eliminate one 
of the unknown constants. Note that if A 2 7 ^ 0, the wave function i/dCO will 



4.1 Unbound States: Scattering and Tunneling 


75 


“blow up” in the limit where x —> oo. This problem can only be avoided by taking 
A 2 = 0, yielding 

1^2 (x) = B 2 e~ klX , x > 0 
The requirement that i/q = \fr 2 at x — 0 gives 


A\ + B\ = B 2 


while the requirement that d\jj\/dx = d\J/ 2 /dx at x = 0 gives 

ik\ A \ — ik\B\ = — k 2 B 2 

These two equations can be solved to express A| and B\ as functions of B 2 : 



and the full wave function becomes 


f\(x) = 

B 2 ( k 2 N 

T ( ,+ T. 


k\) 

H 

A 

o 

1^2 (X) = 

B 2 e~ hx , 



x > 0 


As in the previous section, we can calculate the transmission and reflection prob¬ 
abilities. The probability of reflection is 


R = 


B\B\ 
A| A! 


(Bi/2) 2 


(B 2 /'2) 2 



= 1 

and the transmission probability is T — 1 — R = 0. In this case, the quantum 
mechanical calculation produces a result in agreement with the classical result: 
the particle will always reflect back at the step boundary. 

However, there is one strange result here that does not agree with the clas¬ 
sical calculation. In the classical case, the particle stops exactly at the step and 
reflects back. However, in our quantum mechanical calculation, the wave function 
is nonzero for x > 0. This indicates that the probability of finding the particle in 




76 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


V(.v) 

4 


V = 0 


V = 


Vo 


0 a 


V=0 


x 


FIGURE 4.7 A potential with a step of finite width. 


the classically forbidden region (,r > 0) is nonzero. This penetration into the clas¬ 
sically forbidden region is a purely quantum mechanical effect that has no analog 
in classical mechanics. It is so small as to be unobservable at the macroscropic 
scale, but it can have important consequences at the atomic and nuclear scales as 
shown in the next section. 

An important limiting case of our solution occurs in the case of an "infinitely 
high" potential barrier. Of course, no physical potential can be infinite, but this 
simply means that Vo is much larger than any typical particle energies: E <$C Vo. 
In this limit, ki —* oo. and 1 // 1 (x) becomes negligible for any x > 0. In this limit, 
therefore, it is a good approximation to take as a boundary condition. xf/(x) = 0 at 
x = 0. 

Tunneling 

In the previous section, we saw that the wave function can penetrate into the 
classically forbidden region of a step-function potential. Now we examine what 
happens if the step has finite width. Consider the step potential shown in Figure 4.7. 
This potential has V = 0 for x < 0 and x > a , and V = Vo for 0 < x < a. We 
assume that a particle is incident from the left with energy E < Vo. In classical 
physics, such a particle will always bounce off of the barrier and reflect back to the 
left. However, quantum mechanics makes a very different prediction: the particle 
will sometimes be able to tunnel completely through the barrier and emerge on the 
other side! 




4.1 Unbound Slates: Scattering and Tunneling 


77 


To see why this happens, recall our qualitative solutions to the Sehrodinger 
equation from the previous chapter. We expect the wave function to oscillate in the 
regions x < 0 and a > a but to curve away from the horizontal axis for 0 < x < a. 
We saw that for an infinitely wide step, the particle penetrates into the classically 
forbidden region. If the potential is reduced to zero after a finite distance, as in 
Figure 4.7, the wave function spills out onto the other side, where it oscillates. The 
magnitude of this effect can be calculated using the Sehrodinger equation. 

This potential yields three distinct solutions to the Sehrodinger equation, cor¬ 
responding to the three regions x < 0, 0 < x < a, and a* > a. As in the previous 
section, we get 


(jr) = A i e ik ' x + B\e ~ ik ' x , x < 0 
— A 2 e k2X + B 2 e ~ k '- X , 0 < x < a 


with 


\/2mE 
h 

>/2ffi(V 0 - E) 
h 

while the solution for x > a is similar to the solution for x < 0, with the leftward- 
moving part of the wave function deleted (as in Equation 4.16): 

^(x) = B',e ,k ' x 

Now we have two pairs of boundary conditions: 



t/m(0) = ^2(0) 


djh 

dx 


( 0 ) 


d\j/ 2 
dx 


( 0 ) 


(4.23) 

(4.24) 


and 


— 1A2 (a ) 


d\j/2 

dx 


( a ) 


d\jj\ 

dx 


(a) 


Equations (4.23)-(4.24) give 


(4.25) 

(4.26) 


A | + B\ — A 2 + B 2 
ik\ A i — ik\ B\ = k 2 A 2 — k 2 B 2 


and Equations (4.25)-(4.26) give 





78 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


These equations can be solved to derive four of the unknown coefficients in terms 
of the fifth. We are interested primarily in the transmission probability, given here 
by 

T _ * 3**3 
A*A\ 

After some tedious algebra, we obtain 

(4.27) 

Clearly, T is nonzero even when E < Vo, so a particle can tunnel through the 
barrier and emerge on the other side! The classical regime corresponds to the limit 
k 2 a 1; this is why tunneling is never observed in classical systems. 



Example 4.2. A Baseball Tunneling Through a Wall. 

Suppose that a baseball is tossed at a wall; what is the probability that it will tunnel 
through to the other side? 

The mass of a baseball is 0.14 kg, and we will assume it is tossed gently at 
1.0 m/s. It is reasonable to take E <5C Vo- We can derive an upper limit on T by 
taking k 2 = \flmE/h instead of k 2 = v / 2m(Vo — E)/h. Then 

k 2 = p/h 

= (0.14 kg)(1.0 m/s)/(1.05 x 10 34 J s) 

= 1.3 x 10 33 nr 1 

Taking the width of the wall to be a — 0.2 m, we get 

k 2 a = 2.6 x 10 32 


Then 



This is such an enormous number that the factor multiplying sinh 2 (k 2 a) in Equation 
(4.27) is of no consequence. Then Equation (4.27) gives 

T = iO- 10 ” 

This is an almost unimaginably tiny probability: a decimal point followed by 10 32 
zeros. (In fact, if written out as a decimal, the number wouldn’t actually fit inside the 
visible universe.) Needless to say, we do not observe baseballs tunneling through 
walls. 




4.2 Bound Systems 


79 



FIGURE 4.8 The potential experienced by an a particle in a nucleus. 

Tunneling is seen, however, on the nuclear scale. An important example is a 
decay in which a heavy nucleus such as U 238 emits an a particle (consisting of 
two protons and two neutrons). The potential seen by the or particle consists of 
an attractive nuclear force at short distances plus the Coulomb repulsion from the 
remainder of the nucleus at large distances (see Figure 4.8). The total energy E 
can be measured for the emitted a particle, and it is found to be lower than the 
height of the potential barrier. The only way for an a particle to escape, therefore, 
is by quantum-mechanical tunneling. As a consequence, the typical lifetimes of 
a-emitting nuclei are enormous. For example, U 238 has a lifetime oft =6.5 x 10 9 
years: half the age of the universe! 

Another application of tunneling is seen in the tunnel diode. This is a solid-state 
device in which, over a certain range in applied voltage, electrons tunnel through 
a potential barrier. This tunneling current increases with applied voltage up to a 
maximum value and then decreases with voltage. This results in an interesting 
property for the tunnel diode: over some range in applied voltage, the current 
decreases as the voltage increases. Thus, over this range in voltage, the resistance 
is negative! 


4.2 ■ BOUND SYSTEMS 

We now move from unbound systems to bound systems. We will consider two 
important examples of bound systems in one dimension: the infinite square-well 
potential and the harmonic oscillator. The infinite square well has less physical im¬ 
portance than the harmonic oscillator, but it is simpler to solve and will illustrate 




80 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 



FIGURE 4.9 The infinite square-well potential. 


concepts that can be applied elsewhere. Both of these one-dimensional problems 
will serve as a “warm-up” for the more physically-relevant (but also more com¬ 
plicated) three-dimensional systems examined in Chapter 6. 


The Infinite Square Well 

Consider the one-dimensional infinite square-well potential of width a , shown in 
Figure 4.9. This potential has V(x) = 0 for 0 < x < a, and V(x) = oo for x < 0 
and x > a. Of course, no physical potential can be truly infinite, but this potential 
will be a good approximation for any system with sharp potential barriers such 
that V » E. 

For our idealized system, it is clear that a particle with any energy E will be in 
a bound state. The Schrodinger equation for 0 < x < a can be written 


d 2 f 


2m E 

n r 


if = o 


(4.28) 


Since E > 0, the general solution will have the form of either Equation (4.4) 
or Equation (4.5), which are completely equivalent. It will be more convenient, 
however, to use the trigonometric form of the solution (Equation 4.5), which gives, 
as the general solution 


if{x) = Ci cos {^( 2 . ~mE/h 2 x S j + C 2 sin ^JlrnEjh^x 

The boundary conditions in the problem imply that ^ (0) =0 and if (a ) = 0. The 
first of these gives 


if{ 0) = C| cos(0) + C 2 sin(0) = Ci = 0 





4.2 Bound Systems 


81 


Hence, C\ = 0, and the wave function is simply 


i j/(x) = C 2 sin fh 2 x 


The second boundary condition, x(r(a) = 0, implies 


if/(a) — C 2 sin ( J2mE/h~a — 0 


(4.29) 


(4.30) 


We cannot take C 2 = 0 (or the wave function would vanish everywhere!), so we 
must assume that 


sin 


2 mE/h 2 a 


= 0 


The function sin(x) is zero for x = 0, jt, 2tz, ..,, nn . Hence, Equation (430) can 
only be satisfied if the argument of the sine function is an integer multiple of n : 

y2m£ /h 2 a = tin, n = 1,2,3,... (431) 


Note that we exclude the case n = 0, which would also produce a wave function that 
vanishes everywhere. The only free parameter in Equation (431) is the energy E. 
This equation tells us that the Schrodinger equation has a solution only for certain 
discrete values of E which satisfy Equation (431). Solving Equation (431) for E, 
we get 


E n 


h~7T 2 2 

- 

2m a 2 


n = 1,2,3,.., 


(432) 


where we have labeled the energy with a subscript corresponding to the value of 
n on the right-hand side. 

It is clear that the energy of this system is quantized. The energy E cannot have 
an arbitrary value as in classical physics. Instead, only the set of discrete values 
given by Equation (432) is allowed. Although we showed qualitatively in the 
previous chapter that the Schrodinger equation for bound systems leads to energy 
quantization, this is our first explicit calculation with the Schrodinger equation that 
demonstrates such quantization. 

Equation (432) has an interesting corollary. Since the smallest allowed value 
for n is n = 1, the smallest possible energy for the particle is 


E 1 


h2 2 

n 71 
2 ma 2 


(433) 


Hence, the particle cannot have zero energy; it must have at least the minimum 
energy specified by Equation (433). This energy is called the zero-point energy of 
the system. 






82 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


The wave function i/(x) can now be simplified in Equation (4.29) by substitut¬ 
ing Equation (4.32) for the energy E. This gives 


tyn — Ci sin 


nrcx 


where \j/ n is the form of the wave function corresponding to a particle with energy 
E n . The constant C? is calculated by normalizing the wave function: 


f tn 
JO 


( x)*\j/„(x)dx = | C 


.2 r . 2 nnx i 
,t| / sin - ax 

Jo « 


|C 2 |' 


Taking, for convenience, C 2 to be real and positive, we get 

C 2 ^ 

Note that for this case, the coefficient Cj does not depend on n. This will not 
necessarily be the case for solutions with other potentials. The normalized wave 
functions and corresponding energies are then 



The first few wave functions (for n = 1,2, 3) are shown in Figure 4.10. Note the 
resemblance to standing waves in a pipe. The wave functions i/r„ (x ) are alternately 
symmetric (for odd n) and antisymmetric (for even n) about .v = a/2. In fact, it is 
possible to show (see Exercise 4.10) that for any symmetric potential, the solutions 
of the Schrodinger equation w ill be either symmetric or antisymmetric. Note also 
that i/Ti crosses the .v-axis n — I times. 

The probability of finding the particle in a small interval dx at the location x 
is just \f/(x)*\J/(x)dx. The behavior of \}/(x)*xt(x) is shown in Figure 4.11 for 
n = 1, 2, 3, and 50. As n —> oc, we expect to approach the classical regime. It 
is clear from Figure 4.11 that quantum mechanics predicts, in the limit of large 
n, a uniform probability of finding the particle anywhere inside the square well. 
However, this is exactly what would be expected for the classical case of a particle 
bouncing continuously between the two walls of a closed container; the particle 
spends an equal amount of time at every point in the container. 





4.2 Bound Systems 


83 



n = 1 n = 2 n = 3 


FIGURE 4.10 The infinite square-well wave functions, i//„(wh for a = 1.2, and 3. 



FIGURE 4.11 \ft n (x :)*$„(*) for the infinite square well, for n = L 2, 3, and 50. 

A physical system of this sort can be constructed by sandwiching a thin layer of 
semiconductor between thicker layers of a different semiconductor (an example 
of a semiconductor heterostructure mentioned at the beginning of this chapter). 
The electrons are then free to move along the thin layer, but are confined in a 
potential well perpendicular to this layer. When such a structure is extended to 
three dimensions (so that the electrons are effectively confined to a single point), 
it produces a quantum dot . 

The Harmonic Oscillator Potential 

We now examine a potential with more physical significance: the harmonic oscil¬ 
lator potential. The one-dimensional harmonic oscillator potential is 





84 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


This potential is familiar from classical mechanics, where it represents the potential 
energy of a mass attached to an ideal spring with spring constant K. Before solving 
the quantum harmonic oscillator, we first review the behavior of the classical 
harmonic oscillator. For a classical spring, the force is given by 


F(x) = -Kx 

so the equation of motion for the mass is 

d 2 x 


m 


dt 2 


-Kx 


One example of a solution to this equation of motion is 

x = A cos(cttf) 


for the position of the mass, and 

v = — j4a>sin(a>0 

for the velocity of the mass, where co = -JK Jm. Thus, the motion of the mass is 
sinusoidal. This solution gives a constant value for the total energy: 

1 , 1 , 

E = -m v 2 + - Kx" 

2 2 

= ^-m A 2 co 2 sin 2 (cot) + - K A 2 cos 2 (cot) 

2 2 

= ~KA 2 (4.35) 

2 

Before proceeding to solve the Schrodinger equation with this potential, it is 
reasonable to ask why this potential would be of any interest at all. We certainly 
do not expect, for example, to see particles attached to atomic-scale springs! The 
answer lies in the fact that this potential is an excellent approximation to the motion 
of a particle undergoing small oscillations about the minimum of any potential. 
Consider an arbitrary potential V (x), and choose the origin of the x-axis to lie at 
the minimum of this potential. Now consider a particle trapped near the minimum 
of the potential at x = 0. The potential can be approximated as a Taylor series near 
the origin: 

V(x) = V(0) + V'(0)x + V"(0)x 2 + ;T'"(0)x 3 + • • ■ 

2 6 

The first term in this equation is just a constant and can be ignored, since we 
are always free to redefine the zero of a potential. The minimum of the potential 
lies at x = 0, so V'(0) = 0, and the second term is zero. This leaves only terms 
proportional to x 2 , x 3 , x 4 , and higher powers of x. But if x is sufficiently small, then 
x 2 !$> x 3 » x 4 • • •, and we need worry only about the x 2 term. Thus, a particle 



4.2 Bound Systems 


85 


undergoing small oscillations about the minimum of the potential will experience 
the approximate potential 


V(jc) = -V"(0)x z 


which has the same form as Equation (4.34) with K — V"(0). For example, the 
motion of ions in a crystal lattice is often approximated using a harmonic oscillator 
potential. 

We now proceed to solve the quantum harmonic oscillator. The Schrodinger 
equation with the one-dimensional harmonic oscillator potential is 


ft 2 dH 

2m dx 2 


+ 2 Kx V 


Ef 


(4.36) 


In order to simplify the algebra involved in solving this equation, it is convenient 
to define a new independent variable ^ given by 


5 = 


~h^~ 


X 


and a new constant k proportional to the energy: 


k = 


h\ K 


(4.37) 


Both .v and k are dimensionless; otherwise they have no special significance beyond 
simplifying the calculation. 

In terms of s and k. Equation (4.36) simplifies to 


drijr 

7s 2 


+ (k — s 2 )if 


= o 


(4.38) 


To derive a solution, consider what happens for s 2 k. In this limit, Equation 
(4.38) looks like 


d 2 ^ 


s 2 ijs = 0 


(4.39) 


Even this simplified version of the equation cannot be solved exactly, but we can 
find an approximate solution valid for large s: 


\j/ = Ae s ^ 2 + Be s '^ 2 


d 2 \fr 


ds 2 


T = A(1 +s z )e s '*■ - B( 1 -s~)e 


-i J /2 


For this solution, 



86 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


and for s I 

^^ As 2 e ^/2 + Bs 2 e -^/ 2 = s 2 f 
as- 

satisfying Equation (4.39). However, we also want our solution to be well behaved 
in the limit where 5 —> ±oo. This means that A — 0 (otherwise, the solution “blows 
up” at ±oo). Thus, we expect the solution to resemble, in the limit of large s 2 , 

i~ e~ s (4.40) 

This is clearly not an exact solution of Equation (4.38), but it suggests that we look 
for an exact solution of the form 

\(f(s) = f(s)e ~ x ^ (4.41) 

Substituting this form for i/r into Equation (4.38) and simplifying, we get a differ¬ 
ential equation for f(s ): 


d lL 

ds 2 



+ (A — 1)/ = 0 


(4.42) 


To find a function /(.v) which satisfies this equation, we expand / (s) out in a 
power series: 


DO 

f(s) = Y^ClnS' 1 (4.43) 

n= 0 

where the coefficients a„ must be chosen to satisfy Equation (4.42). Substituting 
this form for /(,v) into Equation (4.42) gives 

CO OQ CO 

A — 1) = 0 

«=2 n =0 n =0 

Rewriting the first term in this equation in terms of m — n — 2 and combining the 
last two terms gives 

00 00 

y^(/» ± 2 )(m + I )a m+2 s m ± (A -2 n - 1) '^2a n s n = 0 

m=0 n=0 

(Note that in is just a summation label, so that it can be changed back to n in the 
first term of this equation.) In order for this equation to be satisfied, the left-hand 
side must be identically zero. This can only be achieved if the factor multiplying 
every power of s is zero. This requirement gives 

(n + 2)(n ± l)a „-|_2 ± 2« — 1 )a„ = 0 


^n(« — l)a n s n 2 — 2 + ( 



4.2 Bound Systems 


87 


which can be used to fix a n + 2 in terms of a n : 


&n+2 


2 n -T 1. *— X 
(n + 2 )(n + 1 ) a> 


(4.44) 


A relationship of this sort is called a recursion relation. Given ciq and ci \, Equa¬ 
tion (4.44) can be iterated to calculate all of the other terms in the power series. 
However, an arbitrary choice for qq and a\ will not, in general, give an acceptable 
solution. The reason is the solution should look like Equation (4.40) when s is 
large. If the factor f(s) multiplying the exponential in Equation (4.41) is a finite 
polynomial then the exponential factor will dominate the polynomial at large s, 
giving the desired asymptotic behavior for fi(s) at large 5 (Equation 4.40). An 
infinite polynomial, on the other hand, will dominate the exponential at large s % 
leading to an unacceptable solution. 

We therefore require that the power series must terminate at some finite value of 
n. This can be achieved by choosing an appropriate value of X in Equation (4.44). 
Recall that X is simply proportional to the energy £, and we expect the energy to 
be quantized, so that the Schrodinger equation will have solutions for only a set 
of discrete values of E. Thus, fixing the value of X in Equation (4.44) is simply 
equivalent to fixing the value of E to give a solution for the Schrodinger equation. 
Consider what happens if we set 


X = 2n + \ (4.45) 

for some fixed value of n . As an example, suppose that we choose A = 13, so that 
X = 2n + 1 for n = 6. Then we fix a value for qq which determines a 2 through 
Equation (4.44). Then a 2 determines uq, and uq determines a 5 , at which point 
Equation (4.44) with n = 6 and X = 13 gives a% = 0. Then a\ 0 = 0, an = 0, and 
so on. However, this still leaves the odd values of n : a j, a 3 , .... There is nothing 
to force this sequence of terms to terminate at a finite value of n (since we chose n 
to be even in Equation 4.45). Therefore, to obtain a finite polynomial in Equation 
(4.43), we must choose a\ =0, so that all of the odd-numbered terms are zero. 
Conversely, if we take n in Equation (4.45) to be odd, then oq and all of the other 
even-numbered terms must vanish. 

This procedure produces a set of solutions 1 /^( 5 ), corresponding to our choice 
of X in Equation (4.45), and X , in turn, gives the corresponding energies E n in 
Equation (4.37). Here is what the first few solutions look like: 


n 

= 0 , 


= Cte-*'-' 2 


(4.46) 

n 

= 1 , 

1 p](s) 

= C\(2s)e~ 

s 2 /2 

(4.47) 

n 

= 2 , 

^l(s) 

= C 2 (4s 2 - 

2)e- s2/1 

(4.48) 

n 

= 3 , 

^3 CO 

= C,(8.v ? - 

12.v)<?“"' 2/2 

(4.49) 

n 

= 4 . 


= C 4 (16.v 4 ■ 

- 48.v 2 + 12)<? - '' 2/2 

(4.50) 




88 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


</'o( s) j (-V) 



FIGURE 4.12 The harmonic oscillator wave functions for the lowest four energy 
states: n =0. I, 2. 3, where.v = [(Km)' /4 /h^ 2 ]x. 


where the constants C„ must be fixed to normalize the wave functions. The nor¬ 
malization condition gives 



The polynomials appearing in Equations (4.46)~(4.50) are called Hermite poly¬ 
nomials '.; they crop up in other areas of physics and mathematics. The harmonic 
oscillator wave functions for n — 0, 1.2. 3 are shown in Figure 4.12. As expected 
(Exercise 4.10), these wave functions are alternately even and odd, since the po¬ 
tential is symmetric about x = 0. 



4.2 Bound Systems 


89 


The energies corresponding to these wave functions can be derived from Equa¬ 
tions (4.37) and (4.45): 


2n + \ = k 


h V K 


so 



Recall that the frequency of a classical harmonic oscillator is co = ■JK / m . so that 
the energies can be written 



(4.51) 


Once again, we see the phenomenon of a zero-point energy: the lowest-energy 
state, n = 0, does not have zero energy. Instead, its energy is 


Eo = 

Note further that the energy levels for the harmonic oscillator are evenly spaced; 
adjacent energy levels differ by hco. 

The behavior of \f/*r(r is shown in Figure 4.13 for the states n = 0, 1,2, 3. It 
is clear that for these wave functions is nonzero in the classically forbidden 
region V > E (in fact, it is nonzero for all of the harmonic oscillator wave functions 
over all space). Once again, we see that the particle can penetrate into a region 
which, classically, it should not be able to reach. 

These quantum probabilities can be compared with the expected result in the 
classical limit. Consider a classical harmonic oscillator with total energy given by 
Equation (4.35). The oscillating mass moves between the limits x_ = -yjlEjK 
and .xv = y/TEjK with its largest velocity at the center of the oscillator and the 
smallest velocity near x_ and x+. Thus, the mass spends more time near x_ and 
x + and less time near the middle of the oscillator, so if we take a random snapshot 
in time. w»e are most likely to find the particle near x_ and x+, and least likely to 
find it in the middle. 

More quantitatively, if we pick a random time, the probability P of finding the 
mass in a small interval dx is proportional to the time dt that it spends in that 
interval. But dt = dx/v, and v can be determined from Equation (4.35): 

v = J (2 £ — Kx 2 )/m 


Thus, we have 


P oc 


m 


V 2 E - Kx 2 


dx 



90 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 





n=2 n =3 

FIGURE 4.13 \(r*\(r for the harmonic oscillator, n = 0, 1,2, 3. 


This classical probability, suitably normalized, is shown in Figure 4.14. This dis¬ 
tribution clearly shows no relation to the low-n quantum-mechanical probabilities 
in Figure 4.13. However, if we take a large value for n, corresponding to the clas¬ 
sical limit, we obtain a result which begins to resemble the classical probability. 
In Figure 4.14, we show for n = 50. It is clear that in this limit, the quantum 
mechanical probability approaches the classical result. 

One application of the quantum harmonic oscillator is in the physics of di¬ 
atomic molecules. The two nuclei in a diatomic molecule can vibrate about their 
equilibrium separation, and the potential for these nuclei can be approximated by 
a harmonic oscillator potential. It is found that the vibrational energy levels are 
well described by Equation (4.51). (See also Exercise 4.13.) 



Exercises 


91 


/\«/rV 



FIGURE 4.14 The classical probability P of finding the harmonic oscillator mass in a 
small interval dx at a position x is shown as a solid curve. The quantum probability density* 
for n = 50, is shown as a dashed curve. 


EXERCISES 


4.1 Show that the differential equation solution given in Equation (4.5), y = C\e'^ x + 
C 2 e~ l ^ x , is completely equivalent to the solution in Equation (4.6), y = 
D | cos(VAx) + Di sin(VAx), and express C\ and C 2 in terms of D\ and D 2 . If 
both D i and Z> 2 are real and nonzero, is it possible for both C\ and C 2 to be real? 

4.2 Show that the general expression for the wave function for a free particle, given 
by Equation (4.8) as 4>(x, t) = C\e likx ~ 0)t) 4- c 2 e l( ~ kx ~ <ot \ is not an eigenfunction of 
momentum unless C\ == 0 or C 2 = 0. 

4.3 A particle with mass m and energy E is moving in one dimension from right to left . It 
is incident on the step potential V (x) = 0 for x <0 and V(x) = V {) for x > 0, where 
Eo > 0, as shown on the diagram. The energy of the particle is E > V {) . 


V(x) 



(a) Solve the Schrodinger equation to derive \f/(x) for x < 0 and x > 0. Express the 
solution in terms of a single unknown constant. 

(b) Calculate the value of the reflection coefficient R for the particle. 



92 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


4.4 A particle with mass m and energy E is moving in one dimension from left to right. It 
is incident on the step potential V (x ) = 0 for .v < 0 and V(x) = V 0 for x > 0, where 
Vo > 0, as shown on the diagram. The energy of the particle is exactly equal to 
i.e., £ = V 0 . 

V(x) 

4 



x = 0 


(a) Solve the Schrodinger equation to derive \ff(x) for x < 0 and x > 0. Express the 
solution in terms of a single unknown constant. 

(b) Calculate the value of the reflection coefficient R for the particle. 

4,5 Consider reflection from a step potential of height V 0 with E > V 0 , hut now with an 
infinitely high wall added at a distance a front the step (see diagram): 


j 



(a) Solve the Schrodinger equation to find x//(x) for x < 0 and 0 < x < a. Your 
solution should contain only one unknown constant. 

(b) Show that the reflection coefficient at x = 0 is R = L This is different from the 
value of R previously derived without the infinite wall. What is the physical 
reason that R = 1 in this case? 

(c) Which part of the wave function represents a leftward-moving particle at x < 0? 
Show that this part of the wave function is an eigenfunction of the momentum 
operator, and calculate the eigenvalue. Is the total wave function for x < 0 an 
eigenfunction of the momentum operator? 

4.6 An electron is accelerated through a potential difference of 3.0 V and is incident m 
a finite potential barrier of height 5.0 eV and thickness 5.0 x ICC 10 m. What is the 
probability that the electron will tunnel through the barrier? 




Exercises 


93 


4,7 Consider an infinite square-well potential of width a . but with the coordinate sys¬ 
tem shifted so that the infinite potential barriers lie at x ~ —a/2 and x — a/2 (see 
diagram): 


i 

i 

i 

l 

1 


U(x) 


t 

} 

i 

i 

i 



x 


(a) Solve the Sehrodinger equation for this ease to calculate the normalized wave 
functions xj/ n (x) and the corresponding energies E n . 

(b) Explain why you get the same energies as for the square well between x =• 0 and 
x — a, but a different set of wave functions, 

4.8 A baseball (see Example 4.2) is confined between two thick walls a distance 0.5 m 
apart. Calculate the zero-point energy of the baseball 

4.9 A particle is trapped inside an infinite one-dimensional square well of width a in the 
first excited state (n = 2). 

(a) You make a measurement to locate the particle. At what positions are you most 
likely to find the particle? At what positions are you least likely to find it? 

(b) Calculate (p 2 ) for this particle. 

4.10 A panicle is bound in a one dimensional potential V(x), where V(x) is symmetric, 
i.e., V(x) = V(—x), 

(a) Suppose that i//(x) is a solution of the Sehrodinger equation with energy E. Make 
the change of variables v = —x, and show that x(ri—x) is also a solution of the 
Sehrodinger equation with energy £. 

(b) Since the solutions of the Sehrodinger equation for a fixed value of E are unique 
(up to multiplication by a constant), the result from part (a) implies that i /f(x) = 
cxf/ ( —x), where c is an unknown constant. Use this result to show that \p (x) must 
be either even I x//(—x) = xfj(x)\ or odd [?//(—x) — —xj/{x)\. 

(c) Fora particle bound in a one-dimensional symmetric potential so that V{-.x) = 
V(x), show that all of the following are true: 

i, xj/*xf/ is a symmetric function, 

1L (x) = 0, 
iii. (p) — 0 . 

4.11 Consider the semi-infinite square well given by V(x) = — Vo < 0 for 0 < x < a and 
V (x ) = 0 for x > a. There is an infinite barrier at x — 0 (hence the name “semi- 



94 


Chapter 4 One-Dimensional Time-Independent Schrodinger Equation 


infinite"). A particle with mass m is in a bound state in this potential with energy 
E < 0, 

(a) Solve the Schrodinger equation to derive jjf(x) for x > 0. Use the appropriate 
boundary conditions and normalize the wave function so that the final answer 
does not contain any arbitrary constants. 

(b) Show that the allowed energy levels E must satisfy the equation 


V2 m(E + V 0 ) 

tan —■——— - ~a 

h 


-(E + V 0 ) 


(c) The equation in part (b) cannot be solved analytically to give the allowed energy 
levels, but simple solutions exist in certain special cases. Determine the conditions 
on V(> and a so that a bound state exists with E — 0. 

4.12 A particle of mass m moves in a harmonic oscillator potential. The particle is in the 
first excited state. 

(a) Calculate (x) for this particle. 

(b) Calculate (p) for this particle. 

(c) Calculate (p 1 ) for this particle. 

(d) At what positions are you most likely to find the particle? At what positions are 
you least likely to find it? 

4.13 The oscillation frequencies of a diatomic molecule are typically 10 12 Hz—10 14 Hz. 
Derive an order of magnitude estimate for the harmonic oscillator constant K for such 
molecules. 

4.14 A particle of mass m is bound in a one-dimensional power law potential V(x) = 
Kx&. where ft is an even positive integer. Show that the allowed energy levels are 
proportional to 

4.15 A particle is moving in a simple harmonic oscillator potential V(x) — ~Kx 2 forjc > 0, 
but with an infinite potential barrier at x — 0 (the paddle ball potential). Calculate the 
allowed wave functions and corresponding energies. Do not worry about normalizing 
the wave functions. 

4.16 A particle moves in one dimension in the potential V(x) = Voln(x/xo) for x > (X 
where xq and Vo are constants with units of length and energy, respectively. There is 
an infinite potential barrier at x = 0. The particle drops from the first excited state 
w ith energy E\ into the ground state with energy £?u by emitting a photon with energy 
E\ — £q. Show that the frequency of the photon emitted by this particle is independent 
of the mass of the particle. 





CHAPTER 



Math Interlude B: Linear Algebra 


We have already seen that linear operators occupy a central place in quantum 
mechanics. In particular, in Chapter 3 we noted that for any observable quantity 
0 , we can find a linear operator 0 with the following properties: 

L If a particle is in a state with a definite value of o, then the wave function -f 
for the particle is an eigenfunction of 0 with eigenvalue o: 

Of = of (5.1) 

Note that O is an operator and o is a number , but O and o must have the 
same physical units (joules, meters, etc.). If Equation (5,1) is satisfied, then 
if a measurement of the observable is made, the result is guaranteed to be o. 

2. If the particle is not in a state with a definite value of o, then i fr will not be an 
eigenfunction of 0, and Equation (5.1) will not be satisfied. If a measurement 
of the observable is made, there Is no way to predict the result. However, the 
expectation value of o is still a well-defined quantity given by 

(o) = J d 3 r 

The properties of linear operators are part of a more general branch of mathe¬ 
matics called linear algebra, and it is this subject which we examine in more detail 
here. 


5.1 ■ PROPERTIES OF LINEAR OPERATORS 

Recall that an operator 0, in order to be a linear operator, must have two properties: 

0[cf (x ) j = cOf(x) 
and 

0[f(x) + g(x)] = Of (x) + Og(x) 
for all functions f(x) and g(x), and all complex numbers c, 

95 



96 


Chapter 5 Math Interlude B: Linear Algebra 


We can go further and define addition or subtraction of linear operators. If P 
and Q are two linear operators, then their sum, R = P + Q, is defined by 

Rf(x) = (P + Q)f(x) = Pf(x) + Qf(x) 

A similar result holds for subtraction: if R = P — Q, then 

Rf(x) = Pf(x ) - Qf(x ) 

In fact, we have already implicitly used this definition when we introduced the 
Hamiltonian operator H. The Hamiltonian operator is defined as the sum of two 
operators: 


H = 



V 2 + V 


so that 


h 2 , 

Hx/s = - —VV + Vx// 

2m 

We can also define the product of two linear operators. If P and Q are linear 
operators, then R — PQ is defined by 

Rf = ( PQ)xj/ 

= P(Qis) (5.2) 

where Equation (5.2) means that we first apply the operator Q to x{/, and then we 
apply the operator P to the result. The operator PQ is also called the composition 
of P and Q. 


Example 5.1. Multiplication of Two Operators. 

Let X be the one-dimensional position operator, X\}j = xxjr, and let D be the 
derivative operator: Dx\r = d\j//dx. Calculate DX. 

Applying DX to an arbitrary function xfj {x) gives 


so that 


DXxp 


dx 


(xxj/) 


= xj/ + X 


df 

dx 


- (1 + XD)xjf 


DX=\+XD 


(5.3) 



5.1 Properties of Linear Operators 


97 


Example 5.1 demonstrates that the multiplication of operators differs from the 
multiplication of ordinary numbers in one important respect: it is not commutative! 
Clearly, Equation (5.3) implies that DX ^ XD. 

In fact, the question of whether or not operators commute is of central impor¬ 
tance in quantum mechanics. Hence, it is customary to define a special quantity 
called the commutator . For any two operators A and fi, the commutator is denoted 
by the symbol [A, £], and it is defined as 


[A, B] = AB - BA 


When two operators do commute with each other, their commutator is, obviously, 
zero. 


Example 5.2. The Commutator of H and P. 

As an example, we calculate the commutator of the one-dimensional Hamiltonian 
operator H and the one-dimensional momentum operator P. 

Applying [H, P] to an arbitrary wave function, i/(x), gives 


[H, P]if = HPf - PH if 


' ft 2 d 2 tr/ ' 

.dx//' 


' . d ' 

h 2 8 2 x}/ 

~ ^ 7j T + V (*) 
2m d.x- 

—ih -— 

L dx J 


-lb — 

l. dx _ 

- — -^ + V{x)xk 
2m dx 1 


. h 3 3 -V 
1 2m dx 3 


df h 3 aV dv d\i/ 

ihV{x)~~ - i—\ + ihxlf — + ihV(x)-X- 
d.r 2m ox- dx ox 


dv 

= ih—f 

ox 


Hence, 

dv 

[H.P] = ih — 

dx 


Some important general properties of commutators are (for any operators A, 
B, and C) 

(5.4) 

(5.5) 

(5.6) 

(5.7) 

(See Exercise 5.1.) 

The importance of commutators for quantum mechanics arises in the following 
way. Suppose that I have a particle for which I want to measure two different 
observables, a and b. (For instance, I might want to measure the position and 


[A,B] = ~\B,A] 

[A, A] = 0 

[A + B,C] = [A,C] + [B,C] 

[A, BC\ = [A, B]C + B[A. C | 




98 


Chapter 5 Math Interlude B: Linear Algebra 


momentum of the particle.) Is it possible for the particle to be in a state of definite 
a and definite b at the same time? The answer is “yes" but only if the corresponding 
operators A and B commute. 

To see why this is the case, recall that in order for the particle to be in a state of 
definite a, it must be an eigenfunction of A with eigenvalue a: 

Ax// = axj/ (5.8) 

Now we make an additional assumption: there are no other wave functions xjr (other 
than multiples of xjs) that are eigenfunctions of A with the same eigenvalue a. (If 
two different eigenfunctions have the same eigenvalue, and one eigenfunction is 
not a multiple of the other, then the eigenfunctions are said to be degenerate. The 
argument presented here can be extended to the case of degenerate states, but it 
is somewhat more complicated.) Now suppose that A and B do commute so that 
[A, B] = 0. Operating on both sides of Equation (5.8) with the operator B gives 

BAxjs = Bax// = aBxfr 

But A commutes with B, so we can rewrite the left-hand side of this equation to 
get 


A(Bx//) = a(BxJr) 

So we have shown that Bx// is an eigenfunction of A with eigenvalue a. However, 
we assumed that the eigenfunctions of A were all nondegenerate, so that any 
eigenfunction of A with eigenvalue a must simply be a multiple of x//. Thus, Bxj/ 
must be a multiple of x[r, e.g., bxj/, and 

Bf = bxfr 

So i fr is simultaneously an eigenfunction of both A and B. Physically, this means 
that the particle is in a state of definite a and b, and both quantities can be measured 
simultaneously. 


Example 5.3. Simultaneous Eigenfunctions. 

For which potentials V(x) is it possible to find solutions of the one-dimensional 
time-independent Schrodinger equation which are also states of definite momen¬ 
tum? 

We calculated [ H, P] in Example 5.2, finding 

dV 

[H, P] — ih — 
ox 

Solutions of the one-dimensional time-independent Schrodinger equation are 
eigenfunctions of H\ in order for them also to be eigenfunctions of P, we must 
have [ H , P] = 0 which implies V = constant (and the constant can be set to zero). 
Hence, free particles are the only solutions of the time-independent Schrodinger 



5.1 Properties of Linear Operators 


99 


equation that can be in states of definite momentum; it is precisely these states 
which we examined in the previous chapter. 


Another very famous commutator is provided by the momentum and position 
operators. Consider the one-dimensional case, [P, X]: 


IP. W - 




= —itu// 


which implies 


[P, X] = -ih 

Thus, a particle can never be in a state which is simultaneously a state of definite 
momentum and a state of definite position. If the particle is at a definite position, 
its momentum cannot be measured exactly, and vice-versa. 

These results regarding commutators provide a useful blueprint for measur¬ 
ing quantities of interest We will usually want our system to satisfy the time- 
independent Schrodinger equation, which implies that the wave function is an 
eigenfunction of H and represents a state of definite energy E. We will then want 
to find a set of operators A, 5, C, ..., which all commute with H and also with 
each other: 

[//, A] = 0 
[H. B | = 0 
[H % C] = 0 

[A, B] = 0 
[A,C] = 0 
[B,C] =0 


The fact that all of these operators commute with each other means that we can 
find a wavefunction \j/ for which 


Hf = Ef 
Aif/ = a\{/ 

Sir =bilr 

and so on. The set of eigenvalues £, a, h , etc. can be used to specify \j/ and are called 
good quantum numbers. When a measurement of the corresponding observables 
is made, the results will be £, a, /?, .... 




100 


Chapter 5 Math Interlude B: Linear Algebra 


5.2 ■ VECTOR SPACES 

In this section we introduce the concept of an abstract “vector space.” First, con¬ 
sider a familiar three-dimensional vector. It can be represented in one of two 
equivalent ways: either as a quantity with a given magnitude and a direction in 
three dimensions, or as a set of components, (x. y. z). Similarly, a two-dimensional 
vector can be represented as a quantity with a given magnitude and direction in the 
jc-v plane, or as a two-component quantity, (x. y). It is the component representa¬ 
tion which we will generalize. One can specify a vector in n dimensions as a set of 
n components: (rj, r 2l ..., r n ). Obviously, an n-dimensional vector cannot be rep¬ 
resented in ordinary three-dimensional space as a quantity with a magnitude and 
direction, but any of the normal vector operations can be performed on it by using 
its components. For instance, the sum and dot product of two three-dimensional 
vectors r = (n, r 2 . r 2 ) and s = (.sy, .sy, sy) are just 


r + s = (r\ 4- .Vi, r 2 4 s 2 , 4 s 2 ) 


and 


r • s = r|X) + r 2 s 2 4 r 2 s 2 

In the same way, for two n-dimensional vectors, r = (rj, r 2 ,..., r n ) and s = 
(sy , s 2 , , s n ). the sum and dot product are 

r + S = (r i + 5|, r 2 + s 2 ,... ,r n + s n ) 


and 


r • s = r,.V| + r 2 s 2 H-h r„s„ 

A vector space is simply a collection of objects (called vectors) which act. in a 
general way, like familiar three-dimensional vectors. For instance, the sum of two 
vectors r and s. must also be a vector: 


r 4 s = t 


while the product of a number c and a vector r must also be a vector. 


(5.9) 



( 5 . 10 ) 


Note that for our ordinary three-dimensional vectors, c can only be a real number, 
but for some vector spaces c is assumed to be a complex number. Thus, there are 
two different kinds of vector spaces: real vector spaces (for which c in Equation 
(5.10) is restricted to be a real number) and complex vector spaces (for which c 
can be complex). In quantum mechanics, we will deal exclusively with complex 
vector spaces. 




5,2 Vector Spaces 


101 


n 

10 - 

8 -- 




+-► i 
3 


FIGURE 5.1 A plot of r, as a function of i for the three-dimensional vector r = (1,4. 9). 


The properties given in Equations (5.9) and (5.10) may seem almost trivial, but 
they allow the notion of a vector to be generalized to a wide variety of other systems. 
For example, consider the set of real numbers. Clearly, they obey Equations (5.9) 
and (5.10) (as long as we restrict c to be a real number). Hence, the set of all real 
numbers is a vector space with each real number acting as a vector. This result 
becomes obvious when we realize that the real numbers are just equivalent to one- 
component vectors, (rj); the set of real numbers forms a one-dimensional vector 
space. 

A less obvious result is that the set of functions f(x) can also be treated as a 
vector space. Clearly, the sum of two functions fix) and g(x) is also a function; 

f(x) + g(x) = h{x) 

and we can multiply a function by a real or complex number to get another function. 
In fact, the set of functions behaves like an infinite-dimensional vector space! To 
see this, consider first the three-dimensional vector r = (1,4, 9). We can display 
this vector in a rather strange way, plotting r t as a function of i (see Figure 5.1). 

Similarly, an //-dimensional vector (/q, ri . r n ) can be displayed in a plot of r t 

as a function of /. An example for n = 7 is shown in Figure 5.2. Consider what 
happens to Figure 5.2 as we take the limit n oc. The points become denser, 
merging into a continuous curve: a function! (See Figure 5,3.) This is a plausibility 
argument rather than a rigorous proof, but it can be shown rigorously that a suitably 
defined set of functions is, in fact, an infinite-dimensional vector space. 

Inner Products 

As we have seen, one of the standard operations on a set of three-dimensional 
vectors is the dot product; r • s = r\S\ + mz + It is trivial to generalize this 




102 


Chapter 5 Math Interlude B: Linear Algebra 





40 




-jT - * ■ * -1-1-H-i 

0 2 4 6 8 

FIGURE 5.2 A plot of r, as a function of i for a 7-dimensional vector r = (rj , r 2 ,..., r 7 ). 

to other finite-dimensional vectors, but what happens when we have an infinite¬ 
dimensional vector space, such as a set of functions? 

In this case we need to define a more abstract concept called an inner product. 
Note that the dot product is a function which takes two vectors, r and s, and produces 
a real number. Now suppose we have two vectors, 0 and 0, in an abstract vector 
space. In analogy with the dot product, we take the inner product to be a function 
which takes 0 and 0 as inputs and produces a real or complex number as the output. 
(As already noted, quantum mechanics makes use of complex vector spaces, so 
our inner products will produce complex numbers.) The inner product of 0 and 0 
will be denoted (0|0), such that 


(0|0) = C 

where c is a complex number. Note that, in the world of mathematics, there is 
no standard notation for the inner product. In addition to the notation introduced 
here, (0, 0), (0, 0), and (0|0) are also used. Later, we will encounter yet another 
notation for inner products, called the Dirac notation, which diverges in some 
respects from the standard mathematical notation for inner products, but which is 
ultimately equivalent to it. 

If 0, 0, and 6 are vectors in an arbitrary complex vector space, and c is a 
complex number, then inner products have the following properties: 


(5.11) 

( 5 . 12 ) 

(5.13) 

(5.14) 


where, as usual, the * denotes complex conjugation. Note that Equations (5.12) 
and (5.13) together imply that 

(c0|0) = c*(0|0) (5.15) 


(0 + 0 | 0 ) = ( 0 | 0 ) + ( 0 | 0 ) 
(0 |c0) = £"(0|0) 

( 010 ) = ( 0 | 0 )* 

( 0 | 0 ) >0 




5.2 Vector Spaces 


103 


n 



In physics, the standard convention is given by Equations (5.12) and (5.15); math¬ 
ematicians, on the other hand, use the reverse convention: (n/r |0) = c(\j/\(p) and 
iif\c<f)) = c*{fy\4>). Needless to say, we will use the physics convention through¬ 
out. 

For ordinary real three-dimensional vectors, the dot product satisfies Equations 
(5.11)—(5.14) and is therefore an inner product (see Exercise 5.3). However, we 
are most interested in the vector space of functions, so we need to define an 
inner product for this case. We argued at the beginning of this section that a 
function resembles an n-dimensional vector in the limit where n goes to infinity. 
We can use this analogy to derive a reasonable inner product for functions. For 
two n -dimensional vectors, r = (rj, r 2 , ..., r n ) and s = (si, sj, ■ ■ •, s„), the n- 
dimensional inner product is r • s = + r 2 Si +■ ■ • • + r n s n . Now consider two 

real-valued functions, f(x) and g(x). The quantity analogous to the n -dimensional 
inner product is / (xi)g(jci) + f(xi)g(x 2 ) + • • ■ + f(x n )g(x n ), in the limit where 
n -> oo. Taking the continuum limit, the inner product should be / f(x)g(x) dx. 



104 


Chapter 5 Math Interlude B: Linear Algebra 


For complex-valued functions, in order to satisfy Equations (5.13) and (5.14), the 
inner product becomes / f{x)*g{x)dx. 

More rigorously, consider the set of complex-valued functions in three dimen¬ 
sions. For any two such functions, /(r) and g(r), the inner product will be defined 
as 


(/IS) = 


J f (r) + g(r) uf 3 r 


(5.16) 


where the integral is taken over all of three-dimensional space. For the frequently 
encountered case of functions in one dimension, this reduces to 


(/IS) = 



f(x)*g(x) dx 


(5.17) 


Although we used a rough analogy to finite-dimensional vectors to motivate these 
expressions, these definitions do not depend on this argument. All that is neces¬ 
sary for our expressions in Equations (5.16) and (5.17) to represent valid inner 
products is that they satisfy Equations (5.11)—(5.14). This is indeed the case (see 
Exercise 5.3). 


Example 5.4. An Inner Product. 

Consider two functions, x(r(x) = e~ x /2 and <p(x) = xe~ x ~ /2 . What is (xj/\<p)l 
We have 


(xf/\(f>) = f xj/(x)*(f){x) dx 

Jx=—oo 

= r (e~ x2/2 )*xe- x2/2 dx 

J }c = —CO 

=£ 


xe x dx 


but note that .re is an odd function [i.e., a function for which f(—x) = —fix) J. 
so the integral from —oo to 0 cancels the integral from 0 to oo. Hence (i/^|0) = 0. 


Some of the results from Chapter 3 can now be cast in a much more compact 
form, using inner product notation. For instance, the requirement that the wave 
function xj/ be normalized can now be written as 


(xl/\x}f) = 1 

In analogy with three-dimensional vectors, normalization means that the “length"’ 
of the infinite-dimensional vectors is 1. 





5.2 Vector Spaces 


105 


Similarly, the expectation value of an operator O can now be expressed in the 
compact form; 


(o) = (xjf\ Ox//) 

Adjoint and Hermitian Operators 

The inner product derived in the previous section can be used to pair up every 
operator with a second operator called its adjoint operator. If A is an operator, 
then the adjoint operator of A is written as A f , and it satisfies the equation 


(<p\Ajf) = MW) 


(5.18) 


for all (j> and i jr. 


Example 5.5. The Adjoint of the Derivative Operator. 

Consider the one-dimensional derivative operator D; what is its adjoint? 
We can write, for arbitrary $ and x[r 

f x W 

(<p\Dxlr) = I <p(x) —- dx 

J X = —OG dx 


Integration by parts gives 


f°° dd>* 

0 <P\D f) = mx)*rlr(x)r x - / -f xfr(x)dx (5.19) 

J X=~ 00 OX 

If 4> and xf/ represent wave functions for physical particles, we can assume that 
<f> -* 0 and i/r —► 0 as x -> ±co, so the first term in Equation (5.19) vanishes, and 
the second term simplifies to 


(<p\Df) 





(-om) 


Hence, from the definition of the adjoint operator, D f = ~D. 


From the definition of the adjoint operator (Equation 5.18), the following gen¬ 
eral properties can be derived: 


(cPf =c*P f 

(p + Q) f = + e + 

(PQf = Q f P f 

(P f f = p 


(5.20) 


(5.21) 




106 


Chapter 5 Math Interlude B: Linear Algebra 


where c is a complex number, and P and Q are arbitrary operators. The only 
nonintuitive result here is the reversal in the order of the operators P and Q in 
Equation (5.21); this arises because of the way that operators are "peeled away” 
to form the adjoint: 


(cp\P[Q^\) = (P Y (t>\Q^) 

= (Q'\P <P]\f ) 


■so (PQr = Q'P\ 

Note that for the operator corresponding to multiplication by the complex num¬ 
ber c, Equation (5.20) implies that 


It is possible for an operator to be equal to its own adjoint, i.e., O' = O. Such 
operators are called self-adjoint or Hemiitian , and they occupy a special place in 
quantum mechanics. 


Example 5.6. The Position Operator Is Hermitian. 

As an example, we now show that the one-dimensional position operator X is 
Hermitian. 

We can write, for arbitrary cp and j/, 

(0|xy/) = f 4>{x)*x\jj(x) dx 

Jjcsss—oo 

= f [xcp{x)]*\j/(x) dx 

Jx——m 

= (*#<//) 

so X = X, and X is Hermitian. 


The reason that Hermitian operators are important in quantum mechanics is 
that both the expectation values and the eigenvalues of Hermitian operators are 
real. Since observable quantities are always real, we require that the operators 
corresponding to observables be Hermitian. 

Consider first the expectation value of a Hermitian operator Q: (q) = (t//| Qj/)- 
In order for {q} to be real, it must equal its own complex conjugate. From Equation 
(5.13), we have 

(q)* = (Qm) 

From the definition of the adjoint operator, this is equivalent to 

<<?>* = oak?V) 



5.2 Vector Spaces 


107 


and, since Q is Hermitian, we get 


(q)* = (f\Q^) = (q) 


Hence, (q) is real. 

Now suppose that ^ is an eigenfunction of a Hermitian operator Q with eigen¬ 
value q. Since Q is Hermitian, 

i'l'lQ'l') = iQi> W) 


which implies 


q(ir W) = q*(f\f) 
q =q* 

so q must be real. Hence, Hermitian operators have both real expectation values and 
real eigenvalues. We have already shown that the position operator is Hermitian; 
Exercise 5.7 will show that several other operators corresponding to observables 
are also Hermitian. 

Basis Sets 

Among the collection of all three-dimensional vectors, there are three vectors that 
occupy a special place: the unit vectors in the x , >\ and z directions, denoted a*, v, 
and z. This collection of three vectors i, y, and z has several important properties. 
Any three-dimensional vector r can be expressed as a linear combination of a, y 9 
and z, i.e., as the sum of the product of each of these three vectors with a different 
real number: 


r = c\i + c *2 v + C 3 z (5.22) 

This result indicates that we have “enough’" vectors to do the job of decomposing 
every three-dimensional vector. However, we also don’t have “too many” vectors 
in the sense that no one vector in our set of three can be expressed in terms of the 
other two. One can never write, for example, 

z = C]X + c 2 y (5.23) 

If z could be expressed in this way, then £ would be irrelevant; Equation (5.23) could 
just be substituted into Equation (5.22) and z could be eliminated from Equation 
(5.22). In an abstract vector space, a subset of vectors with these two properties 
(every vector can be represented as a linear combination of the vectors in the 
subset, and no one vector in the subset can be expressed as a linear combination 
of the rest) is called a basis. In general, an n -dimensional vector space will have n 
distinct basis vectors. 




108 


Chapter 5 Math Interlude B; Linear Algebra 


The basis set x, y, and z has two other desirable properties. First of all, each 
vector has unit length, and second, each vector is perpendicular to the other two: 
x * y = x • z = y • z = 0. A basis with these two additional properties is called an 
orthonormal basis. 

In general, we can find a basis set for any vector space, but the basis will 
not be unique. In three dimensions, for example, there are an infinite number of 
orthonormal bases, obtained by rotating the x, y, z basis. For instance, another 
perfectly acceptable orthonormal basis is (l/\/2)(i + y), (l/V2)(—x + y), z. 

Even infinite-dimensional vector spaces have basis sets. As an example, con¬ 
sider the set of functions f(x) which are periodic with period 2n. Thus, for these 
functions, f(x + 2n) = f(x). (The inner product for this vector space will be 
defined by integration from 0 to 2n rather than — cc to oo.) Since this is an infinite¬ 
dimensional vector space, its basis set will contain an infinite set of functions. 

Then a familiar example of a basis for this vector space is the set of trigonometric 
functions: 


1 

— 7 = sin jc, 

1 

sin( 2 x),.. 

1 . 

., —= sin(nx),. 

sfzr 

s/ri: 

■s/n 

1 

—HZ COSX, 

1 

cos( 2 x),., 

l 

.., —= cos (nx), 

aJ71 

yfn 

Vn 


Any periodic function can be written as a sum of these functions: 


°° A 00 B 

f(x ) = -~=l sin(nx) + Y —p= cos (nx) 


n=0 


(5,24) 


called a Fourier series. Further, this set of trigonometric functions forms an or- 
thonormal basis, since 


sm m x 


sm«x 


cos mx 




cos nx 1 = 0 


1 , 


m ^ n 
m = n 


and 




= 0 


for all m and n. 

Although any periodic function can be written as the sum of trigonometric basis 
functions, as in Equation (5.24), we need a procedure for determining the constants 
A n and B„ which multiply these trigonometric functions. Again, an analogy to 

three-dimensional vectors is instructive. Consider an arbitrary three-dimensional 
vector r, expanded out as 


r = C)X + c'2 y + c$z 


(5.25) 



5.2 Vector Spaces 


109 


and suppose that we need to determine c\,C 2 , and C 3 . Since our basis is orthonormal, 
we can take the dot product of r with x to obtain 

r • x = ci(x • x) + c 2 (y • x) + c 2 (z • x) 

= C\ (1) + C2(0) + C 3 (0) 

= C\ 

So ci = r • x . Similarly, c 2 = r • y and C 3 = r • z. This result shows that ci, c 2 , and 
C 3 are the magnitudes of the projection of r onto the x, y, and z axes, respectively. 
Therefore, an alternate way of writing Equation (5.25) is 


r = (r • x)x + (r • y)y + (r * z)z (5.26) 


Now suppose we take an arbitrary periodic function f(x) and wish to expand 
it out in the form of Equation (5.24). In analogy to Equation (5.26), we write 


f(x) = (/|—sinx)— 7 = sinx + (/|— 7 = sin 2 x)—sin 2 x + 

v 77 v 77 


1 . 1 
+ (/I -7= sinnx) 


sinnx + ■ • • + (/| 


1 , 1 

cosx )—— cosx 
n V 7r 


+ (/l 


— 7 = cos 2 x)-^= cos 2 x + ••• + (/! 

7%. IT 



1 

cos nx) — 7 = cos nx -)-••• 
V 77 


Thus, the Fourier coefficients A n and B n in Equation (5.24) are the inner products 
of /(x) with the appropriate basis functions, e.g., 


\ 


and 



Of course, the trigonometric functions are not the only possible basis set, and this 
is where the connection to quantum mechanics becomes relevant. Recall that for an 
arbitrary potential V(x), we can find a set of solutions 1/7 (-0 for the Schrodinger 
equation. This set of solutions will itself form a basis set, so that an arbitrary 



110 


Chapter 5 Math interlude B: Linear Algebra 


function can be expressed as a linear combination of these solutions: 

fix) = Y^ C n'l / n(x) 

n 


The usefulness of this sort of expansion will become apparent later. 


EXERCISES 

5.1 Verify the commutator properties given in Equations (5.4)—(5.7), i.e., for any operators 
A, B, and C, show that 

[A,B] = ~IB,A\ 

[A, A\ = 0 

[A + fl,C] = [A.C] + rfl.C] 

[A, BC] = [A, B]C + B[A t C] 

5.2 Consider a particle moving in three dimensions. Is it possible for the particle to be in 
a state of definite p x and y, i.e,, can both its y-coordinate and its momentum in the x 
direction be known at the same time? 

5.3 (a) Verify that the ordinary dot product for real three-dimensional vectors satisfies 

all of the properties of an inner product, given by Equations (5.11 )—(5 . 1 4). 

(b) Verify that the inner product for complex-valued, three-dimensional functions 
defined in Equation (5. 16) satisfies Equations (5.11 )— (5.14). 

5.4 (a) The operators A, ZL and C are all Hermitian with [A, B] = C. Show that C = 0. 

(b) The operators A and B are both Hermitian with [A, B] = ih. Determine whether 
or not AB is a Hermitian operator, 

5.5 The one-dimensional parity operator U is defined by FI \j/(x) = tyi—x). In other 
words, n changes x into —x everywhere in the function. 

(a) Is n a Hermitian operator? 

(b) For what potentials V(x) is it possible to find a set of wavefunctions which are 
eigenfunctions of the parity operator and solutions of the one-dimensional time- 
independent Schrodinger equation? 

5.6 (a) Let Q be an operator which is not a function of time, and let H be the Hamiltonian 

operator. Show that 


ot 

Here (q) is the expectation value of Q for an arbitrary time-dependent wave 
function 4c which is not necessarily an eigenfunction of /?, and ([£), H]) is the 
expectation value of the commutator of Q and H for the same wave function. 
This result is known as Ehrenfesfs theorem. 



Exercises 


111 


(b) Use this result to show that 




What is the classical analog of this equation? 


5.7 (a) Show that the one-dimensional momentum operator is Hermitian. 

(b) Use this result to show that the one-dimensional Hamiltonian operator H with 
potential V(x) is Hermitian, What (reasonable) assumption must be made about 
V (x) to derive this result? 


5.8 Suppose that the operator T is defined by T = a Q f Q, where a is a real number, and 
Q is an operator (not necessarily Hermitian). Show that T is Hermitian. 


5.9 Determine all potentials V (x ) for which it is possible to find a set of solutions of the 
time-independent Schrodinger equation which are also eigenfunctions of the position 
operator X, or else show that no such potentials exist. 

5.10 Suppose that two operators P and Q satisfy the commutation relation 


IP , <21 = G 


Suppose that if is an eigenfunction of the operator P with eigenvalue p. Show that 
Qf is also an eigenfunction of P, and find its eigenvalue. 

5.11 The operator F is defined by Fif(x) = ir(x + a) + i/(x —a ), where a is a nonzero 
constant. Determine whether or not Fisa Hermitian operator. 




CHAPTER 

6 


Solutions of the 
Three-Dimensional 
Time-Independent Schrodinger 

Equation 


In Chapter 4 we examined solutions of the one-dimensional time-independent 
Schrodinger equation. 


ft 2 d 2 \p- 
2m dx 2 


+ V ( x)\fr = Exfr 


Of course, the real world is three-dimensional, so in this chapter we will solve the 
full, three-dimensional time-independent Schrodinger equation. Along the way, we 
will need to understand the behavior of angular momentum in quantum mechanics. 
The crowning achievement of the solution of the three-dimensional Schrodinger 
equation (and of this chapter) will be a description of the hydrogen atom. 

The three-dimensional Schrodinger equation is 


— V 2 4/(r, t ) + V(r)4(r, t) = 

2m dt 

We will assume throughout this chapter that the wave function represents a state 
of definite energy E, so that the Schrodinger equation can be written in the time- 
independent form: 


h 2 , 

- — vV(r) + V (r)x(f(r) = Ef(r) 
2m 


( 6 . 1 ) 


In order to solve Equation (6.1), we need to choose a particular coordinate sys¬ 
tem, e.g., rectangular, cylindrical, or spherical. Of course, the physical solution 
does not depend on the coordinate system we choose; instead, the correct choice 
of coordinates can simplify the form of the solution by taking advantage of the 
symmetries in the problem. For example, a central force, defined as a force which 
points radially inward or outward, corresponds to a potential that is a function only 
of radial distance r. (An example is the potential experienced by an electron in 
a hydrogen atom.) For such a potential, the spherical coordinate system is most 
appropriate. However, as a warm-up, we will first examine a problem which can 
be solved in a simple way in rectangular coordinates. 


113 




114 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 



FIGURE 6.1 A particle of mass m is confined to a rectangular box with sides of length 
a, b, and c. 


6.1 ■ SOLUTION IN RECTANGULAR COORDINATES 

Consider a particle of mass m, confined to move in a rectangular box with sides 
of length a, b, and c (Figure 6.1). We choose a rectangular coordinate system with 
x along the side of length a, y along the side of length b, and z along the side of 
length c, and one comer of the box at the origin. Then the potential is simply 

U(r) = 0 , 0 < x < a 

0 < y < b 
0 < z < c 

with infinite potential barriers at x — 0 , x = a, y = 0 , y = b, and z = 0 , z = c. 
This is the three-dimensional analog of the infinite one-dimensional square-well 
potential discussed in Chapter 4. 

Inside the box, where V — 0, Equation (6.1) becomes 

h 2 2 

— r—V~^r(r) = Ex/f( r) 

2m 


which can be written as 


3 2 i jf ( 9 2 i j/ ( 3 2 \j/ 


2m E 

IF 




( 6 . 2 ) 


As in Chapter 3, we use separation of variables. (The solution here is slightly more 
complicated than the separation of variables solutions in Chapter 3, since we now 
have three independent variables.) We take a trial solution of the form 


i/(x, y, z) = i^\(x)if 2 (y)fi(z) (6.3) 

In this equation, x//\ (x) is an unknown function of x and is independent of y and 
z. Similarly, \^ 2 (>’) is an unknown function of y which is independent of jc and z, 
and so on. Our job is then to find the functions if ], xj/ 2 , and xf/ 3. 




6.1 Solution in Rectangular Coordinates 


115 


Substituting Equation (6.3) into Equation (6.2) gives 

9 2 i/q(jt) a 2 t/r 2 (y) 

- , ifoOOibfz) + f i(x)——j— \Mz) + ^i(*)iM;y) 
9x z 9v z 


dz 2 


2m E 

~1T 


tyi (xW2(y)fa(z) 


and dividing both sides by if\(x)\lf 2 (y)^i(z) yields 

1 9 2 Vq(x) 1 9 2 Vr 2 (y) 1 9 2 l/f 3 (z) _ 2wE 

tffi(x) dx 2 fiiy) dy 2 fa (z) dz 2 h 2 

Now consider what happens if we move the second and third terms on the left-hand 
side over to the right-hand side: 


1 d^fajx) _ 2mE _1 d 2 fa(y) _1 9 2 Vq(z) 

iff i(x) dx 2 /2 2 ifo(y) 9y 2 V^fz) 9z 2 

The left-hand side of this equation is a function only of x and is independent 
of y and z. On the other hand, the right-hand side is a function only of y and z 
and is independent of x. There is only one function that satisfies both of these 
requirements: a constant, which is independent of x, y, and z. Hence, we can set 
both the left-hand side and the right-hand side of Equation (6.5) equal to some 
(still to be determined) constant, which we will call C x : 


1 d 2 r/r i(x) 
ir\{x) dx 2 


(6.6) 


But now note that we can again begin with Equation (6.4) and, instead of leaving 
the first term on the left-hand side, we can leave the second or third terms. Leaving 
the second term on the left-hand side, a similar argument gives 


i dhhiy) 
fa(y) dy 2 


(6.7) 


where C y is another undetermined constant, while leaving the third term on the 
left-hand side produces 


l dfafaifa 
fa(z) dz 2 


( 6 . 8 ) 


Thus, we have transformed a single partial differential equation (Equation 6.4) 
into three ordinary differential equations. Note that C x , C v , and C z are not com¬ 
pletely independent. Adding Equations ( 6 . 6 ), (6.7), and ( 6 . 8 ), and comparing with 
Equation (6.4), we get 


2m E 


C X + Cy + C z — ~ 


2 



116 


Chapter 6 The Three-Dimensional Time-Independent Schrddinger Equation 


It is possible to put Equations (6.6)—(6.8) into a form that we have already seen. 
Define the new constants E x , £\ , and E : to be given by C x = —2 mE x /h 2 , C v = 
—2 mEy/h 2 , and C- = —2 mE-Jh 1 . Note that E x , E y , and E : have no physical 
significance, but their sum does; it is the total energy: 

E X + Ey + £; = £■ 

If we rewrite Equations (6.6)-(6.8) in terms of E x , £\, and E z , instead of C x , C v , 
and C z , we get the three equations 


d 2 \l/[{x) _ 2mE x ' / x 

p + .-> 

d~x h~ 

= 0 

(6.9) 

d 2 ^ 2 (>0 2m E v 

+ , • o:( \ ) 

d-y h~ 

= 0 

(6.10) 

d 2 \f/-\(z) 2m E 

-f- 1 + -TT^^CZ) 
d z z n~ 

= 0 

(6.11) 


supplemented by the boundary condition that the wave function must vanish on 
the sides of the box, which gives i f/\(x) = 0 at .r = 0 and x = a, \J/ 2 (y) = 0 at 
y = 0 and y = b, and i fa(z) = 0 at z = 0 and z = c. But we have seen equations 
of this form before. Equations (6.9), (6.10), and (6.11) all have the form of the 
Schrddinger equation for a one-dimensional infinite square well (Equation 4.28) 
with the same boundary conditions as the infinite one-dimensional square well. 
Hence, the solutions to these equations are the same as the solutions previously 
derived in Chapter 4, namely. 


^i(x) oc sin 



with corresponding energy 


h 2 n 2 2 
E„ = - — yn; 

2m ci- 

and similarly for i lri(y) and fail). Using Equation (6.3) to reassemble the wave 
function, we get 



where n x , n y , and n, can each take on positive integer values, and A is the nor¬ 
malization constant. (This normalization constant is A = where V is the 

volume of the box; see Exercise 6.1.) The energy corresponding to a given n x , n y ■ 




6.2 Angular Momentum 


117 


and n- is 


E = E x + Ey + E z 


Consider what happens for the special case of a cube of side a. For this case, 
the wave function with quantum numbers n x , n y , and n z is 

/n x 7rx\ /n v ny\ . /n,jrz\ 
xj/(x, y,z) — A sin ^-j sin —- j sin —J 

with corresponding energy levels 



(n 2 + n 2 + n 2 ) 


Now we see an interesting new phenomenon. Consider the two states n x = 1, 
n y = 1, n z = 2 and n x = 1, n v = 2, and n : = 1. These correspond to two differ¬ 
ent wave functions, but they have the same energy. This illustrates the phenomenon 
of degeneracy. Two different states are said to be degenerate if they have the same 
energy but different wave functions. (As noted in the previous chapter, degeneracy 
in linear algebra occurs when two different eigenvectors have the same eigenvalue. 
In this case the two different wave functions are both eigenfunctions of the Hamil¬ 
tonian H, and they have the same eigenvalue E.) Note that our two wave functions 
in this case are related by an interchange of the x- and y-axes, which leaves the 
potential unchanged. Degeneracies often arise from this sort of symmetry. 


6.2 ■ ANGULAR MOMENTUM 

Before moving on to examine quantum mechanical systems with spherical symme¬ 
try, it is necessary to derive a quantum mechanical treatment of angular momentum. 
Recall that for a classical particle with momentum p at position r relative to the 
origin, the angular momentum is a vector given by the cross product of r and p 
(Figure 6.2): 


L = r x p 

We now need to derive a quantum mechanical operator corresponding to L. Note 
that we already have an operator R corresponding to the position r, namely mul¬ 
tiplication by r, and we have an operator P corresponding to the momentum p, 
namely —ihV. (We will use the lowercase symbols r and p to refer to the physical 




118 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


y 



FIGURE 6.2 In classical mechanics, the angular momentum L for a particle with mo¬ 
mentum p at the position r relative to the origin is L = r x p. 


position and momentum, and the uppercase symbols R and P to refer to the cor¬ 
responding operators.) Hence, the operator corresponding to angular momentum 
should simply be 


L = R x P (6.12) 

In practice, however, it is easier to break L down into its components, and to 
calculate the operators corresponding to the x, y, and z components of angular 
momentum, namely, L x , L y , and L-. Then Equation (6.12) gives 


L x = YP z -ZP y 
L y = ZP X - XP Z 
L- = XPy - Y P x 


All three of these operators are Hermitian. 


Example 6.1. Show that the Operator L z is Hermitian. 

The adjoint of L z is 

Ll = (XP y ) f - (YP x y 

The rules for taking adjoints of sums and products of operators (Chapter 5) give 

L\ = P;X + - Pi K 

But the momentum and position operators are Hermitian, so that 

L\ = PyX - P X Y 

Now recall that P y commutes with X, and P x commutes with Y, so that 

Ll = XPy -YP X = L- 




6.2 Angular Momentum 


119 


so L z is Hermitian. The argument is similar for the other components of L (Exer¬ 
cise 6.5). 


Now consider how well we can measure the angular momentum of a particle. 
Suppose that we have a particle for which we would like to measure the angular 
momentum exactly. For this to be possible, the particle must be in an eigenstate of 
each component of L, i.e., L x , L v , and L z . However, this is possible only if L x , 
L y , and L z all commute with each other. Consider, for example, whether or not 
L x and L y commute with each other. We have 

[L,, L y ] = [ YP Z - ZP y , ZP X - XP Z ] (6.13) 

To simplify this expression, we need to know all commutators of the form 
[ P z , X] 9 [P yj Z], and so on. In the previous chapter, we derived the commutator 
for the one-dimensional momentum and position operators, which corresponds to 


[X, P x ] = ih 

and similarly for the y and z components. But what happens if the position and 
momentum in this equation correspond to different components, e.g., does P y 
commute with XI In general, all of the position and momentum operators commute 
with each other as long they correspond to different components. For instance, 

IX Py]f = x{-ih)~- - (- ih) — (xTjr ) = 0 
ay ay 

A shorthand for this result that encapsulates all of these commutation relations is 


[R a , Pp] = ih8 afj 


where x, y, and z correspond to a = 1,2, 3 and >3 = 1,2,3, and 8 a p is called the 
Kronecker delta ; it has the value 

8 aP = l, a = P 

= 0 , a p 

Furthermore, the different components of position and momentum all commute 
with each other, i.e., 


[X, Y] = [X,Z} = [Y,Z] = 0 


and 


[P X , Py] = [Py, P,] = [Py, P Z ] = 0 


(see Exercise 6.6). 




120 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


The simplification of Equation (6.13) also requires the use of the identities (from 
Chapter 5), 


[A + B,C] = [A, C] + [B. C] 


and 


[AB, C] = A[B.C] + [A,C]B 

These identities allow Equation (6.13) to be reduced to 

I L x , L y ] = [YP Z , ZP X ] - [ZP y , ZP X ] - [YP Z . XP : ] + f ZP y . XP Z ] 

for which the second and third terms are zero (since they contain only factors 
which commute with each other), while the first and fourth terms reduce to 



Similarly, 


and 


IT,-/-,] 

= ihL x 


[L Z ,L X ] 

= ihLy 


Thus, none of the three components of L commutes with any of the others. This 
means that we cannot measure the full angular momentum exactly; instead, we 
can measure only a single component of the angular momentum! We will normally 
take this component to be L z , i.e., we will look for states which are eigenfunctions 
of L-. This choice is, however, arbitrary. It is a convenient choice because, in 
spherical coordinates, the angular variable 0 is normally measured relative to the 
z-axis. But this does not mean that there is anything special about L z : we could 
just as easily have chosen to measure L x or L v instead. 

Can we measure anything else about the angular momentum other than the val ue 
of a single component? In fact, w'e can also measure the square of the magnitude of 
the angular momentum. The corresponding operator is L 2 = Lr x + L\ + Lr, and 
this operator commutes with L z : 

|/- 2 , L z ] = [Ll +L)+ I 2 . L.J 

= [£2,L.] + [L2. LJ-HL:. L : | 





6.2 Angular Momentum 


121 


The last term is zero, and the other terms give 

[L 2 , L z ] = L X [L X , L z ] + [L x , L Z ]L X + L y \L y , L-] + [L y , L-)L y 
= —ihL x L y — ihL y L x + ihL y L x + ihL x L y 
= 0 

This means that a particle can be in an eigenstate of L 2 and L- simultaneously, 
i.e., we can measure the total magnitude squared of the angular momentum and 
its component in the z direction. 

But what result will we get if we actually do measure these two quantities? 
The answer is given by the eigenvalues of L 2 and L z . One might think that these 
eigenvalues are fairly arbitrary and would depend on the particular wave function 
(as is, for example, the case for the eigenvalues of H, which correspond to energy). 
However, this is not the case. If \Jf is an eigenfunction of both L 2 and L z , then the 
eigenvalues of these operators are actually restricted to a small class of possible 
values, which we will now calculate. 

Before beginning this calculation, we need to point out that there are actually 
two kinds of angular momentum that are observed at the atomic level. The first 
is the familiar orbital angular momentum, which is equivalent, classically, to the 
orbit of a particle. An example of this is the angular momentum of the electron 
as it orbits the nucleus of an atom. However, particles, such as the electron and 
proton, also have an internal angular momentum called spin angular momentum. 
Naively, one can imagine these particles behaving like rotating balls. However, 
spin differs so much from our intuitive ideas of rotational motion that this analogy 
is quite crude. We will be mostly concerned with orbital angular momentum in 
this chapter and will deal with spin angular momentum in Chapter 8. However, the 
discussion which follows in this section will be kept as general as possible. When 
dealing with general angular momentum (as opposed to orbital or spin angular 
momentum), we will use the symbol J. Then L will always be used to refer to 
orbital angular momentum, and spin angular momentum will be denoted by the 
operator S. 

Assume, then, that the angular momentum operator J obeys the commutation 
relations we derived above, namely, 

\J X , J y ] = ihJ z 

[y z , J x ] = ihJy 

[jy, Jr] = ihj x 

Furthermore, J 2 commutes with each of the individual operators, J x , J v , and J,. 
We will now assume that ^ is an eigenfunction of J 2 and J-, and we will calculate 
the possible eigenvalues of these two operators. 

To perform this calculation, we will make use of a special set of operators called 
ladder operators. This will allow us to find the allowed eigenvalues of J 2 and J z 
without even calculating what the wave function looks like (although we will, in 



122 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


fact, find explicit forms for the orbital angular momentum eigenfunctions in the 
next section). The ladder operators are designated 7+ and J -, and they are defined 
by 

7+ = J x d - i J x 

7_ = J X — i Jy 

These operators, J + and 7_, are also called the raising and lowering operators, 
respectively. Note that J + and /_ are not Hermition operators, e.g., J\ = 0? — 
i = 7 _. Hence, these two operators do not correspond to observable quantities. 
Rather, they can be used to turn one eigenfunction of J 2 and 7, into another 
eigenfunction with a different set of eigenvalues. 

Suppose, for instance, that 0 is an eigenfunction of J 2 and ./- with eigenvalues 
a, (6, so that 


J 2 \f/ = onf/ 

J z f = fif 

and suppose that J + operates on this eigenfunction to give some new wave func¬ 
tion 4>: 


J+f = <P 

It is now possible to find J 2 <p and J z <p. Note that J 2 commutes with J x and J x 
individually, so it also commutes with J + . Therefore, 

J 2 <p = J 2 (J + x//) = J + {J 2 ^j/) — aJ + ijf = acj> 


so 0 is also an eigenfunction of J 2 with the same eigenvalue as 0. On the other 
hand, J + does not commute with J z . Instead, we have 

[J z ,J + ] = [J z ,J x +iJ y ] 

= [Jz, J,] + i[Jz, Jy ] 

= i h Jy T~ h J x 
= hJ+ 


This means that 


00 = 0 ( 7 + 0 ) 

= 7+7,0 + #7+0 
= pj+f+ hj+xjs 
= OS + /?)0 

To summarize, J + transforms the wave function 0 with eigenvalues a and p (tor 
operators 7 2 and J z ) into a new wave function with eigenvalues a and P + ft. So 




6.2 Angular Momentum 


123 


the eigenvalue of J 2 is unchanged, but the eigenvalue of J z is increased by h. If we 
apply J+ to our new function (f>, we will get yet another eigenfunction of J 2 and 
J z with the same eigenvalue of J 2 but a new eigenvalue for J-, namely ft + 2h. 
We can continue this process, increasing the eigenvalue of J z at each step; hence 
the name ladder operator. (As one might have guessed, J has the opposite effect: 
it keeps the eigenvalue of J 2 fixed, but lowers the eigenvalue of J z by h.) 

It would appear that we could continue this process indefinitely, obtaining an 
infinite number of different eigenfunctions for J z . However, this is not the case; it 
turns out that there is an upper limit on the possible eigenvalues of J z . To see this, 
assume once more that t is a normalized eigenfunction of both J 2 and J z with 
eigenvalues a and ft, respectively. Since J 2 = J 2 + J 2 + J 2 , we can write 

(t, J 2 t) - (f, Jif) = (t, J 2 x t) + it, J$t) 

Now the left hand side is a(t, t) ~ P 2 (t< t) — a ~ P 2 > while the right hand 
side is (since J x and J y are both Hermitian) just {J x t- Jxt) + (Jyt> Jyt)- This 
latter quantity is nonnegative from Equation (5.14). Hence, we have 

a - P 2 > 0 


so 


i6 2 < a 

Note that there is a classical analogy to this result; the z component of a vector 
cannot be larger than the vector itself! Therefore, classically, J 2 < J 2 . 

How do we reconcile this upper bound on ft with the fact that we can repeatedly 
apply J + to t and increase the value of ft each time? This is possible only if there 
is an eigenfunction of J 2 and J z , which we will call tmax, for which 

J+tmax ~ 0 

Thus, as we apply J + repeatedly, we get larger and larger values for ft, but the 
process terminates when we reach tmax■ A similar argument shows that there 
must be an eigenfunction tmin for which 

J—tmin = 0 

We can use these two eigenfunctions to derive the possible eigenvalues of J 2 and 
J z . From the definitions of./+ and J_, it is tedious but straightforward to show that 

J 2 = /__/+ + J 2 + hJ z ( 6 . 14 ) 

J 2 = J+J- + j}-hJ z ( 6 . 15 ) 

Using Equation (6.14), we see that 

J ~ max ” J— J + $max "E J : $max H J z max 

wftmax ” 0 4“ $ max ^ 4” ^ftmax 



124 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 
where fi max is the eigenvalue of J : for the eigenstate 1 jr max . Thus, 


a = Pn,ax + h Pinax (6.16) 

Similarly, applying J~ to the state and using Equation (6.15), we get 

ot = b lun - hb min (6.17) 

Subtracting Equation (6.17) from (15.14) gives 

~~ ^min "6 h(b mux + kinin) = 6 


which has the solution 


b/nin — btnax ( 6 . 18 ) 

But recall that we can apply J + repeatedly to V/ and with each application, 
increase the eigenvalue of J : by h. Hence, all of the eigenvalues of ./- must differ 
from each other by an integer multiple of h. so 

b,nax = b min + nh (6.19) 

where n is an integer. Combining Equations (6.18) and (6.19) gives 

bma a ( b max ) — nft 


so that 


bmax=n- (6.20) 

Define a new number j given by j — nj 2, so that j is an integer or a half-integer: 

j= 0 , ^ 1 ,^, 2 ,... ( 6 . 21 ) 

2 2 

Then from Equations (15.14) and (6.20). we have 

cr = j 2 h 2 + jh 2 =trj(j + 1 ) 

with the possible values of j given by Equation (6.21). 

We also know that fi must go in integer steps of h from —jh to +jti, so 

j6 = - jh, -jh +h . (j - 1 )h, jh 

A convenient way to write this is 

P = mjh, mj = -j, -j + , j 




6.3 The Schrodinger Equation in Spherical Coordinates 


125 


To summarize, then, if xj/ is an eigenfunction of both J 2 and / . then its possible 
eigenvalues for J 2 are ft 2 j{ j + 1), i.e.. 


J 2 $=h 2 j{j + l)v> 


where j is an integer or a half-integer, 



( 6 . 22 ) 


(6.23) 


and the eigenvalues of J z depend on the value of j , namely. 


J : i// = nijhxjf 


(6.24) 


where 


m j = -j.- j + r..., j -1 , j 


(6.25) 


Here is another example of quantization: the total squared angular momentum 
and the z component cannot have arbitrary values. Instead, they are restricted 
to the discrete values given in Equations (6.22)—(6.25). In fact, the quantization 
of angular momentum differs from energy quantization in an important respect: 
although a given potential will have a lowest-energy state, there is no absolute 
lower bound on the energy that can exist in nature: the potential can always be 
altered to produce a lower-energy ground state. With angular momentum, on the 
other hand, there is an absolute lowest angular momentum state. For example, the 
z component of angular momentum, if it is nonzero, cannot be smaller than h/2\ 
this represents the smallest “unit” of angular momentum found in nature. 


6.3 BTHE SCHRODINGER EQUATION IN SPHERICAL COORDINATES 

In quantum mechanics, the case of spherical symmetry is often encountered. The 
most obvious example of this is the potential experienced by the electron in a 
hydrogen atom, given by 


V (r) - - 


1 

4tT6q 


5 

€ 


r 


In this case the potential depends only on the radial distance r; such potentials are 
called central potentials. For central potentials, the spherical coordinate system is 
the most convenient. 

The spherical coordinate system is described by the coordinates r, 6 % and 0, 
where r is the distance from the origin, 0 is the angle relative to the z-axis, and <p 






126 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 



v 

r 


FIGURE 6.3 The spherical coordinate system expresses all positions as a function of r, 
0 , and 0. 

gives the azimuthal angle relative to the jc-axis (Figure 6.3). Spherical coordinates 
are related to the familiar rectangular coordinates by 


x = r sin 0 cos 0 
y = r sin 0 sin 0 
Z = r cos 0 


(6.26) 

(6.27) 

(6.28) 


Using these relations, any operator in rectangular coordinates can be expressed in 
terms of spherical coordinates, and vice versa. 


Example 6.2. Expressing the Operator 3/3$ in Rectangular Coordinates. 

To transform from spherical coordinates to rectangular coordinates, we use 


3 3 a d 9v 9 9c 9 
90 90 9 a 90 9y 90 9c 


Substituting Equations (6.26)~(6.28) for the derivatives gives 

a 

90 


9 9 

r sin 0 sin 0—- + r sin 0 cos 0—- + 0 
9 X 9v 


9 3 

■ v ~~—E X- ~zr~ 
9 a* 9v 


Now we can express the angular momentum operators in spherical coordinates. 
The result of Example 6.2 can be used to find L z : 

-ih~ = -YP x + XP x 

c)(p 

which gives the desired expression for L z : 


L z = 

9 

—ih — 


90 


(6.29) 




6.3 The Schrodinger Equation in Spherical Coordinates 


127 


Note the resemblance to the one-dimensional linear momentum operator, P x = 
—ih{d/dx). In both cases the operator is based on the derivative in the direction of 
the classical motion. Using similar methods (but much more algebra!) the operator 
L 1 can also be derived in spherical coordinates: 


L 2 


h 2 


-Li¬ 
sin 0 3 9 



+ 


sin 2 6 dip 2 


(6.30) 


These operators can be used to calculate explicitly the eigenfunctions and eigen¬ 
values of L 2 and L z . 

Consider first the eigenvalues of L z . From the previous section, we know that 
such eigenvalues must be of the form mh with m given by an integer or half-integer. 
However, this represents the set of all possible eigenvalues for a general angular 
momentum operator; we are dealing with a special case (orbital angular momen¬ 
tum) characterized by a specific angular momentum operator, so it is possible that 
some of these eigenvalues are excluded. 

Assume that xp (r, 6 , <p) is an eigenfunction of L z with eigenvalue mih : 

L z xp = mihx// 

where the / subscript on m indicates that we are dealing with orbital angular 
momentum. Once again, we use separation of variables and assume that the solution 
is of the form 


xp{r,6,<p) = R(r)F(d)G(<p) 

Using this form for xp, along with the expression for L z from Equation (6.29), our 
eigenvalue equation becomes 

dG 

— ihR(r)F(9 )~— = ntih R(r)F(8)G(<p) 
dip 

dG 

—ih - = mihG 

dip 

which has the solution 


G{(p) - e imi,p 


so that 


xlr{r,d,<p) = R{r)F{d)e im,,p (6.31) 

Equation (6.31) gives the general ^-dependence for any wave function that is an 
eigenfunction of L z . 

Now note that we must impose an additional condition on \jf. In spherical 
coordinates, increasing 0 by 2 re just amounts to a 360 degree rotation, so it brings 




128 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 

any system back to the same position in physical space. Hence, it must be true that 

f(r, 9, <p + 27r) = 0(r, 9, 0) 


so Equation (6.31) gives 


^imi(<p+2n) __ ^/m/0 


which implies 


e 2nimi = 1 

This equation is satisfied if and only if mj is a positive or negative integer: 

mi = 0, ±1, ±2, ±3,... 

This is obviously more restrictive than the general condition on angular momentum 
quantum numbers, for which m could be an integer or a half-integer. Since mj 
ranges from —/ to +/ in integer steps, the only way to insure that mi is an integer 
is for / to be an integer as well: 


/ = 0 , 1 , 2 ,... 


This, then, is a distinguishing property of orbital angular momentum: while the 
eigenvalues of L 2 are h 2 l(l + 1) and the eigenvalues of L~ are hm /, just as in the 
case of general angular momentum, / must be an integer, which forces m/ to be a 
positive or negative integer: 


mi — —l, —/ + 1 ,...,— 1 , 0 , 1 ,...,/ 

Half-integer values for / and mi are excluded. 

Now consider the Schrodinger equation in spherical coordinates for some arbi¬ 
trary potential V(r,6,<p). The Hamiltonian still has the familiar form 


H = 


tm 


-V 2 + V(r,9,<p) 


but now V 2 must be expressed in terms of the spherical coordinates r, 6, and <p. 
This transformation is tedious but straightforward; it yields 



1 9 9 

—-sin#-1- 

r 2 sin 9 89 89 


1 9 2 9 

r 2 dr 8r 


1 9 2 ‘ 

r 2 sin 2 9 90 2 _ 


+ V(r, 9, 0) 


But now compare the first and third terms in this expression with the expression 
for L 2 in spherical coordinates (Equation 6.30). It is clear that H can be rewritten 
in the form 


_?r J__9_ 2 _9_ 
2m r 2 dr 8r 


L 2 

2 mr 2 


+ V (r, 9 , 0) 


(6.32) 





6.3 The Schrddinger Equation in Spherical Coordinates 


129 



FIGURE 6.4 The potential V (r) corresponds to a classical central force. The direction 
of the force is always radial (toward or away from the origin), and it produces no torque. 


Note the similarity between this expression and the classical expression for the 
energy of a body moving in a central potential V (r): 




+ V (r) 


In the classical central force problem, the term I 2 jlmr 2 leads to a fictitious “cen¬ 
trifugal force”, producing a minimum in the effective potential for some choices 
of V(r). 

Now we would like to find solutions of the Schrodinger equation which are 
also eigenfunctions of L 2 and L z , i.e., they are states of definite total angular 
momentum and of the z component of angular momentum. In order for this to be 
possible, the Hamiltonian in Equation (6.32) must commute with L 2 and L z . Since 
L 2 and L z are functions only of 0, (j> 7 and the derivatives with respect to 9 and <p, 
they commute with the first term in Equation (6.32), which is a function only of 
r and derivatives with respect to r, Further, L 2 and L z commute with the second 
term, since they commute with both L 2 and 1/r 2 . Thus, the question of whether or 
not L 2 and L z commute with H boils down to whether or not they commute with 
V(r, 6, (p). A very simple way to insure that L 2 and L z do, in fact, commute with 

V is to take V to be a function only of r and to be independent of 9 and 0, since, as 
we have noted, L and L z are functions only of 0,0, and the derivatives with respect 
to 6 and 0. Classically, a potential of the form V (r ) corresponds to a central force, 
i.e., one that is directed radially inward or radially outward (Figure 6.4). Such a 
force produces no torque, so that angular momentum is conserved. We will assume 
for the remainder of this chapter that we are dealing with a potential of the form 

V = V (r). This will be the case, for example, for an electron in a hydrogen atom. 

The full Schrodinger equation, Hxp = Exp, then becomes 


h 2 1 9 ,90 L 2 

-r ~ -h 


2m r 2 dr dr 2mr 2 


xp + V(r)xp = Exp 


Now assume that xp is a eigenfunction of L 2 with eigenvalue h 2 l(l + 1), so that 



130 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 
this equation becomes 


9 


2mr 9 r- 


?(nA) + 


Tr/(/ + I) 
2m r 2 


0 + V (r)0 = E 0 


(6.33) 


where we have also used the fact that 

19 2 90 9 2 0 1 9 ,90 

+ tt = ~ir r 

r 9/'- r dr dr- r- 9r dr 

To solve this equation, we once again use separation of variables, assuming a 
solution of the form 


i Hr, 0,0) = R(r)Y(0, 0) 

Substituting this solution into Equation (6.33) gives 

r-— T 7 ~r(r R(r))Y(0, 0) + - t- /?(r)r ( g. 0) + V(r)R(r)Y(0 , 0) = ER(r)Y(0, 0) 

2mr dr- 2m r- 

The function K(#. 0) can be divided out, yielding an equation for fi(r): 


(6.34) 


Equation (6.34) is called the radial Schrodinger equation. It gives the energy and 
the radial part of the wave function R(r ) for an arbitrary central potential V (r). Of 
course, both the energy and R(r) will depend on the particular form of V (r). Note 
also that the solutions for R(r) and E will, in general, depend on /, but they are 
completely independent of m/, since m/ does not appear in Equation (6.34). The 
physical reason for this is that for a fixed /. a change in the eigenvalue of L z can 
be produced simply by rotating the coordinate axes, and the energy of the system 
should not depend on the choice of the coordinate system. 

It appears that the function Y ( 6 , 0) has vanished from the problem completely. 
In fact, the functional form for Y(6, <p) can be derived from the eigenvalue equa¬ 
tions, 

L 2 0 = h 2 l{l + 1)0 
L ; 0 = him 0 

Substituting the explicit forms for L~ and L : from Equations (6.30) and (6.24) 
gives 

,[ I 9 / 9\ 1 9 

— h l —-I sinf?— ) H--=—-—- 

_sin0 d9 \ d0 ) sin* 0 90* 

9 

—ih — R(r)Y(9, 0) = hm,R(r)Y{0,(p) 

90 


R{r)Y(0, 0) = h 2 l(l + 1 )R(r)Y(6. 0) 


h 2 9 


2mr 9r- 


-(r/f(r)) + 


h 2 l(l 4-1) 


Lmr- 


R(r) + V (r)R(r) = ER(r) 




6.3 The Schrodinger Equation in Spherical Coordinates 


131 


Dividing out the factor R(r) gives two equations for Y(0 , <p ): 


1 3 / . 9 \ 

sm0 — + 

90 / 


sin 0 90 


9 


sin" 0 90 41 J 


T(0,0) = h 2 l(I + 1 )Y((l<p) 

(6.35) 


~ih — y(0,0) = hmiY(6.<f>) (6.36) 

90 


An important point here is that K(0, 0) is determined entirely by the eigenvalues 
/ and mi and is completely independent of V(r). Therefore, it is customary to 
write these functions as Y[ n (9, 0). (For clarity, we write Y™ rather than Y ” l/ ; it is 
understood that the m appearing in Yj n always refers to orbital angular momentum.) 
These functions, which we will now calculate, provide a universal description of 
the angular part of the wave function for all central potentials. Once again we 
assume separation of variables, so that T/”(0, 0) = F(0)G(0), and substitute this 
form for T/ 7? into Equations (6.35) and (6.36). Note that we have already solved 
Equation (6.36) above with the result that 


G(<P) = e ,m ^ 


Now we need to determine the 0-dependence of Y[ n (9, (p). Recall that the largest 
possible value of mi for a given / is mi — I. Thus, if an expression for K/(0, <p) 
is known, it is possible to derive all of the other values of T / w (0, (j )) by repeated 
application of the lowering operator In terms of spherical coordinates, the 
raising and lowering operators are given by 


(6.37) 

(6.38) 

In order to derive T/(0, 0), recall that the raising operator applied to the highest-/??/ 
state gives 0: 

L + Yj(0,<l>)=0 


L+ = he' 
L-. = he 


9 9 

-b i cot 9 — 

d$ d(p 


9 9 

—- + / cot 6 — 
89 90 


In terms of L + in spherical coordinates (Equation 6.37). this equation becomes 


he 1 ' 11 


' 9 Y, 


si 


by! 

- i cot 9 - 


: 0 


which has the solution 


YUe.cp) = (sin 9) , e il<t> 

Now. hy repeatedly applying the operator L_ from Equation (6.38), one can obtain 
all of the other functions Y"‘ ( 0. 0). 




132 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


/ 

mi 

*7" (0,0) 

0 

0 

II 

£ 


1 

1 

1 

2 

2 

2 

2 

2 


_1 

0 

+ 1 


ym 

I I 


Y™ 


ym 


1/2 


8 7T , 

3 x 1/2 


.V 


4tc ) 


sin Oe 


cos0 


-i<)> 


3 \ 1/2 

— I sin Qe 1 ^ 
8;r / 


-2 

-1 

0 

+ 1 
+2 


ym 


ym 


ym 


ym 


ym 

r l 


—) 

32tt ) 


1/2 


sin 2 de 


T5 

8jT 


1/2 


sin0cos0e 


-i<p 


V 16tt ) 


1/2 


(3 cos 2 6 — 1) 


15 \ 1/2 

— I sin 0 cos 6e l<l> 
87T / 


15 \ 


1/2 


32;r 




sin 2 0c 2 '^ 


TABLE 6.1 The first few normalized spherical harmonics, F ; m (#, 0). 

These Y™ functions arise in a variety of different areas of physics; it is difficult 
to overemphasize their importance. They are called spherical harmonics, or more 
colloquially, “Y”-T’-“m’”s. A list of the functions Y"‘ ( 0 , <p) for the first few values 
of / is given in Table 6.1. The constants appearing in the Y' t n functions in Table 6.1 
are chosen so as to normalize these functions, i.e., 

/»jt pin 

/ / \YP(6, <t>)\ 2 sin 9 d9 d(j) = 1 

Je =o J<p=o 

This still leaves an arbitrary complex phase factor; the convention is to choose 
this phase so that Yj~ m (0,0) = (— l) m Y™ (0,0). This accounts for the minus signs 
appearing in some of the Y[ n functions in Table 6.1. 

As derived earlier, all of the spherical harmonics in Table 6.1 are of the form 
Y™ (0,0) = / (6)e' m ^. This implies that Yf is a function only of 0 and is indepen¬ 
dent of 0. These m = 0 functions, F/°(0), are encountered in the theory of elec¬ 
tromagnetic fields, where they arise as the solution of Laplace’s equation, which 



6.4 The Hydrogen Atom 


133 


gives the electric potential in a vacuum. They are called Legendre polynomials 
and are generally written as Pi (cos 0). Then in terms of the spherical harmonics, 
Pi(cosd) = Y®(6). Note further that the/ = 0. m = 0 spherical harmonic is spher¬ 
ically symmetric, i.e., completely independent of both 6 and <p , and it is the only 
spherical harmonic with this property. 

The spherical harmonics are difficult to visualize, since they are complex func¬ 
tions of spherical angular coordinates. However, note that |T/”| 2 , which gives the 
angular dependence of the probability density, is real and is independent of (p (since 
\e m<t> | 2 = 1). In Figure 6 . 5 , we show | K/" | 2 as a function of 6 for l =0, 1,2. These 
are polar graphs in which 6 is the angle relative to the z-axis, and | Y, m | 2 is plotted 
as the radial distance from the origin. 

We have now achieved a remarkably general result for central potentials. For 
any wave function that represents an eigenstate of L 2 and L, in a central potential 
V (r), with angular momentum quantum numbers / and mi, the angular part of the 
wave function will be given by the appropriate T"‘(0, 4 >), completely independent 
of both V ( r ) and the energy E. The potential does determine E and the radial part 
of the wave function R(r), so these will depend on the particular choice of V ( r ). 


6.4 BTHE HYDROGEN ATOM 

We are now in a position to determine the wave functions and energy levels of the 
electron in the hydrogen atom, one of the most important results in all of quantum 
mechanics. Before inserting the Coulomb potential experienced by the electron, 
we first simplify the general form for the radial Schrodinger equation (Equation 
6.34). Multiplying both sides of Equation (6.34) by r gives 

H 2 d h 2 l(l + 1) 

~ — — (rR(r)) H--—-— rR(r) + V(r)rR(r) = ErR(r) (6.39) 

2m or z 2mr A 

This form of the equation suggests the substitution u(r ) = rR(r), simplifying the 
equation to the form 


h 2 9 h 2 l(l + 1 ) 

~——u(r) + —-— y—u(r) + V(r)u(r) = Eu(r) 
2m dr z 2mr l 


Note again that both the wave function and the energies should depend on the 
particular value of l, since it appears in this equation. Two boundary conditions 
can be imposed on the wave function; as usual, as r -» oo, we require u -» 0. 
However, an additional boundary condition must be imposed at the origin; since 
ir a u/r , we must have u -» 0 as r -+ 0 in order to keep x[r finite. 

Now consider the actual hydrogen atom. It consists of an electron with mass 


m e = 9.109 x 1CT 31 kg 




134 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 



FIGURE 6.5 Polar graphs showing | Y™ | 2 as a function of 8 for the indicated values of / 
and m . 


orbiting a proton with mass 

m p = 1.672 x 1(T 27 kg 

In a classical system of this sort, it would be inaccurate to take the electron to orbit 
a fixed proton; rather, the electron and proton orbit their common center of mass. 
Consider, more generally, a classical system consisting of a particle with mass m i 



6.4 The Hydrogen Atom 


135 


at position iq, and a particle with mass m 2 at position n, so the vector separation 
between the particles is 

r = iq — ro (6.40) 

Assume that the potential energy V is a function only of the distance between the 
particles r, where r = |r|. The energy of this system is 

1 /diq\ 2 1 / dr->\ 2 

E = 2 m '{ir) + 2 m2 Uf) + nr) (6 - 41) 

We are free to choose any origin for the coordinate system, so we take it to lie at 
the center of mass of the particles, which gives 

m 1 r i + miXi — 0 

This equation and Equation (6*40) allow us to express iq and r 2 as functions of r: 

mi 

r i = —f— r 

m i + /// 2 
mi 

r 2 = —--r 

m i + m 2 

Substituting these expressions for iq and r 2 in Equation (6.41) gives an expression 
for £ as a function only of the separation between the particles; 

1 / dr\ 2 

£ =2"U) +V,M 

where fi is given by 


m\mi 

M =-— 1 - 

m i + m2 

The quantity /i is called the reduced mass. 

The quantum analog to this result is achieved by replacing m in the Hamiltonian 
with /i, so that 

h 2 2 

// = - V 2 + V 

2/i 


where, for the hydrogen atom, /j. is given by 


m e + m p 


(6.42) 


Note that m e m p , so that fi for the hydrogen atom will be very close to m e . 

Plugging the actual numbers into Equation (6.42) gives 


H = 0.9995 m e 

Hence, using /i instead of m e produces only a small correction in the case of the 
hydrogen atom. However, we will include this correction in our calculations. 




136 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 



FIGURE 6.6 The potential experienced by the electron in a hydrogen atom satisifies 
V ( r ) < 0. Hence, any bound state must have E < 0. 


The Coulomb potential felt by the electron in the electric field of the proton is 


V(r) = - 


1 e 2 
47reo r 


so the radial Schrodinger equation becomes 

h 2 3 l u h 2 l(l + 1) 1 e 1 

- q-- —u - u = Eu 

2 n dr 2 2 (xr 2 4 jt£q r 


(6.43) 


We will now solve this equation. Note that the potential is always negative, ap¬ 
proaching zero as r —> oo. Therefore, a bound state (which are the states we are 
interested in) must have E < 0 (Figure 6.6). To simplify the notation, we take 
e = —E, and rewrite Equation (6.43) as 


d“u 2fie" u 1(1 + 1 )k 2fx 

—r -|-r- - - = —=-£M 

dr 1 4jre 0 h r r 1 


(6.44) 


where e > 0. To solve this equation, first consider its asymptotic behavior in the 
limit as r —> oo. In this limit the second and third terms on the left-hand side go 
to zero, and we are left with 


d 2 u 
dr 2 



which has the solution 

u = e ~ ( ^ lh)r (6.45) 

Of course, this is not the exact solution to the full Equation (6.44), but it suggests 
that we investigate solutions of the form 

u(r) = v(r)e~ ( ^ /h)r (6.46) 

where v(r) is now the unknown function to be determined. Substituting this trial 



6.4 The Hydrogen Atom 

solution into Equation (6.44) gives an equation for v(r): 


137 


d 2 v Is/ljle dv 2pe 2 v /(/-+-l)v 

dr 2 h dr 4 neoh 2 r r 2 

To solve for u(r), we now try a series solution of the form 


(6.47) 


00 

v(r) = Y^A p r p (6.48) 

p =) 

(Note that the constant term, A<), must be zero because of our previously-derived 
boundary condition that n(0) = 0.) Substituting this power-series expansion into 
Equation (6.47) gives an equation in powers of r on the left-hand side, and in order 
for this to be equal to zero, the coefficient multiplying each power of r must vanish. 
Enforcing this condition gives the following relation between the A p coefficients: 


[pip + !)—/(/+ \)]A p+ \ 



(6.49) 


Because l is the angular momentum quantum number, it has some fixed integer 
value for any particular solution. Note that for p = /, the left-hand side of the 
equation vanishes, so that A p on the right-hand side must be zero. But taking 
A p = 0 on the left-hand side of Equation (6.49) gives A p -\ = 0 on the right-hand 
side. This argument can be repeated to give 2 = 0, 3 = 0, and so on all 

of the way down. Thus, the only nonzero coefficients have p > /, namely, A/ +1 , 

A /+2 .There is, however, one additional constraint on the polynomial series in 

Equation (6.48). In order for u(r) to have the correct asymptotic behavior shown 
in Equation (6.45), we do not want the v(r) factor in Equation (6.46) to give the 
dominant behavior at large r. This can be achieved by requiring the polynomial 
to have a finite number of terms. (A finite polynomial is always dominated by 
an exponential at large r; this need not be true for a polynomial with an infinite 
number of terms.) In order for the polynomial to terminate after a finite number 
of terms, the right-hand side of Equation (6.49) must vanish for some value of p, 
which we will call n. Note that n must be greater than /, since only terms with 
p > l - 1 - 1 are nonzero. In order for the term on the right-hand side to vanish at 
p = n, we must have 

2n sjlpe 2 pe 1 ^ 

ft /? 2 4jt6o 

Solving for E = — e gives 





138 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


This is exactly the same result derived earlier for the Bohr model of the atom (Chap¬ 
ter 1)! This equation also gives the physical meaning of the parameter n introduced 
above: it determines the energy levels of the hydrogen atom. Furthermore, we have 
derived a constraint on /: since n > l, 

I < n — 1 


Now, however, we can move beyond the Bohr model to derive the wave functions 
as well. The radial wave function is 


R(r) = 


u{r) 

r 

V JS± e -(V2^/H)r 

r 


where v(r) is a polynomial in powers of r ranging from r /+l up to r". Since R 
depends on n and l, we write it as R„i(r). Solving for the power-series coefficients 
using Equation (6.49) gives explicit expressions for /?„/(r). To simplify these 
expressions, we introduce a physical quantity a$ with units of length, given by 



This length ao is called the Bohr radius. Then the first few normalized radial wave 
functions are 



Each radial wave function can be combined with the corresponding Y™ to obtain 

the full wave function. Since R„i(r) is a function of n and /, and Y"'{8, <p) is a 
function of / and mi, the full wave function will be determined by three quantum 
numbers: n, /, and /«/: 


if. nim,(r , 0,<t>) = R„i(r)Y” l (6, 4>) 


Each of these quantum numbers has a physical significance: n, called the prin¬ 
cipal quantum number , determines the energy of the electron, / determines the 




6.4 The Hydrogen Atom 


139 


n 

1 

mi 

tynlmiir, 

■ 0.4>) 


1 

0 

0 

^HX) = 

43 1 )’ 

i /2 

e -r/a 0 




%/^r \a 0 J 



2 


0 


0 




3/2 



2 

2 

3 

3 

3 

3 

3 

3 


0 xJ/ 2 


1 


10 


1 ±1 ^2l±l = 


4>/27T 

] 


(!f(- 

V«o/ \ao 


8 v^r \flo 


3/2 


«o 


- r/2a ° sinde** 


1 

2 

2 

2 


0 

0 

±1 

0 

±1 

±2 


^3 


300 


81 V3?r 


m 2 \ 

27 — 18— + 2*— ) e~ rlM 
ao at J 


f , m = J^n) 3 n ( 6 -L\(L ]e ^ cose 


^3 til = 


1^320 = 


81 spa \a 0 ) \ <3o/ V«o 

] / 1 \ 3/2 


^ f 6 - — V— 1 e~ r/3a ° sin Be** 

81 v^rV«o/ V a 0 /\ao, 

1 




/ 1 \ 3/2 / r 2 \ 

( — j { — I e ' 73 "°(3 cos 2 6 — 1) 
\a 0 J \az/ 


V r 32±l = 


l / / 32±2 = 


1 / 1 \ 3/ Vr 


81 VF \«o/ \«o 

1 / 1 \ 3/2 / r 2 


162^/tt \flo 


- r /M, S (f) Q cos Be** 


r' 73 "" sin 2 0e ±2,<t ’ 


TABLE 6.2 The normalized hydrogen wave functions for n = 1,2,3. 


total squared angular momentum, and ni( gives the z component of the angular 
momentum. We have also derived the allowed values for all of these quantum 
numbers: 


n = 1,2,3,... 


Z < n — 1 


mi = —/, —1 + 1 , . 

..,0 I,/ 


The normalized wave functions for n — 1,2,3 are given in Table 6.2. Note that 
the energy E n is entirely determined by n and is independent of/, despite the fact 
that the radial Schrodinger equation (Equation 6.34), which determines the energy 
levels, contains /. This is an “accidental” degeneracy, in the sense that it occurs 
only for the Coulomb potential { V (r) a 1/r). For the case of a general radial 
potential, E will depend on /. 





140 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


The total number of states with a given energy E n is straightforward to compute. 
A given value of n corresponds to n — 1 possible states for l, and each / state has 
21 + 1 possible values for nit. Thus, the total degeneracy for energy E n is 1 + 3 + 
5 + + 2n — l — n 2 . We will see in Chapter 8 that each electron also has two 

possible spin states corresponding to two possible values of the z component of 
the spin angular momentum, so the true degeneracy for a given n is actually 2n 2 . 

Although this model for the hydrogen atom agrees with the Bohr model as far 
as the predictions of the energy levels are concerned, it presents a very different 
physical picture of the behavior of the electrons. In the Bohr model, electrons lie 
in distinct “orbits” around the proton, with the length of each orbit fixed by the 
need for it to comprise an integer number of de Broglie wavelengths. In the picture 
we have just derived (which is the best current picture of the atom), the behavior 
of the electrons is defined entirely by the wave functions. The idea of an electron 
“orbiting” the central proton is essentially abandoned. The electron does not follow 
a well-defined trajectory; instead, one can only talk about the probability of finding 
the electron inside a given volume V, which is given by 


P = Wni m ,(r,9,(t))\ 2 r 2 dr sin 9dd d(j) 

Jv 

We can also define radial probability densities, P n i(r), by integrating the full prob¬ 
ability density over 9 and </> but retaining the r-dependence: 

pit pin 

P nl (r)dr=\ / |f«zm,(A 0,4>)\ 2 r 2 dr sin 9 dO d<p 
Je =o J<t >=o 

Then P n /(r)dr gives the probability of finding the electron in a small interval 
dr at a radius r from the proton. A graph of P n i(r) as a function of r is given 
in Figure 6.7. Clearly, the larger n states, corresponding to larger energies, have 
larger mean radii for the electron. Although it is meaningless to talk about a single 
fixed “radius” of the atom in this picture, the probability of finding the electron 
at a given r is largest where P„i{r) is peaked. It is also common to use (r) as a 
reasonable definition of the atomic radius for any given n, l state. 

Example 6.3. Calculate (r) for the Electron in the Ground State of Hydrogen. 
The ground state wave function is 


t^ioo = 


1 '*.) 


3/2 


,-r/ao 


y/n \a 0 J 

The desired expectation value is 

poo pm plit 

(r) = / / r 2 dr sin 9 d9 d<p\ij/ \qq\ 2 r 

Jr =0 J0 =0 0 

The wave function is independent of 9 and 0, so 

pit pin 

I I sin 9 d9 d<p = An 
Je =o J<p =o 



6.4 The Hydrogen Atom 


141 


P nAr) 



FIGURE 6.7 The radial probability density, P ni (r), as a function of r for the n = 1 and 
n = 2 states of the hydrogen atom. 


and 


/•oo 1 / 1 

(r) = / 4ir 2 rfr- -| 

Jr=0 7T \OQ, 


4 r 00 

= / e“ 2r/o V 3 dr 

«n Jr=0 


4 _6_ 

( 2 M )) 4 



Thus, (r) = 3ao/2 = 8 x 10 11 m. For this reason, the “radius of the hydrogen 
atom” is often taken to be about 10" 10 m. 


This picture of the hydrogen atom has further explanatory power far beyond 
what is possible with the Bohr model. In particular, a more detailed analysis of the 
hydrogen spectrum reveals that the energy levels are not exactly given by Equation 
(6.50), but are slightly perturbed from the predicted values. It is possible to explain 
these perturbations within the context of the model we have just derived; this will 
be done in Chapter 9. 



142 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 

EXERCISES 


6,1 A particle is confined inside a rectangular box given by 0 < jc < a , 0 < v < b. 
and 0 < z < c. The solution to the Schrodinger equation is 

V/U, v, z) = ^ sm (—-) sin (-£-) sin (-^-) 

where A is a constant. The energy levels are given by 
E = (h 2 jr 2 /2m){n 2 Ja 2 +w;/Jr + /ir/c 2 ). 

(a) Normalize the wave functions. You should obtain A = y/%JV\ where V = abcm 
the volume of the box. 

(b) Suppose the particle is in the ground state. Calculate the probability of finding 
the particle in the lower fourth of the box, i.e., in the region z < c/4, 

6*2 A particle is confined inside a cubic box with edge of length a. Show that there are 
six different wave functions that have E = 14(/? 2 7r 2 /2m« 2 ). (This is called sixfold 
degeneracy.) 

6.3 (a) A particle is confined inside a rectangular box with sides of length a. a , and 

2a. What is the energy of the first excited state? Is this state degenerate? If so. 
determine how many different wave functions have this energy, 

(b) Now assume the rectangular box has sides of length a, 2a, and 2a. What is the 
energy of the first excited state? Is this state degenerate? If so, determine how 
many different wave functions have this energy. 

6.4 (a) A particle with mass m and energy E is inside a square tube with infinite potential 

barriers at x = 0, ,x = a, y = 0. and v = a. The tube is infinitely long in the 2 
direction. Inside the tube, V = 0. The particle is moving in the 4-z direction. Solve 
the Schrodinger equation to derive the allowed wave functions for this particle. 
Do not try to normalize the wave functions, but make sure they correspond to 
motion in the +z direction. 

(b) Energy should not be quantized in this case because the particle is not in a bound 
state. Use the answer from part (a) to show that this is indeed the case. 


6.5 

Show that L x and L v are Hermitian. 


6.6 

Verify that [X, Y] = 0 and [P x , P y ] = 0. 


6.7 

Calculate these commutators: 



[Lz> Al. 

[L Z .X\ 


lL t , PA. 

[z. ; .n 


[L t , P : ], 

l L z , Z] 


6.8 Show that h has units of angular momentum. 

6.9 The operator Q obeys the commutation relation [ Q . H) = £ 0 {?, where £q is a con¬ 
stant with units of energy. Show that if \f/(x) is a solution of the time-independent 
Schrodinger equation with energy E , then Q$(x) is also a solution of the time- 
independent Schrodinger equation, and determine the energy corresponding to 

C?tKv). 



Exercises 


143 


6.10 The simple harmonic oscillator in one dimension can also be solved by the method of 
ladder operators. This solution is simpler and more elegant than the one in Chapter 4. 

(a) For a particle of mass m in a one-dimensional simple harmonic oscillator potential 
V(x) = | Kx “. the Hamiltonian operator is 


„ tr d 2 1 ^ 2 
H = — —- — —r + — K X 
2m tlx 2 2 


Define the ladder operators a and a+ to be given by 


= 



h d 
sflm dx 


and 



Show that 


and 


\H.ci,} = ham 


\H, a,] = —fiaxi,. 

where co = fK /in. 

(b) Suppose that if(x) is a solution of the time-independent Sehrodinger equation 
for the harmonic oscillator with energy E. Show that a+ y } /(x) is also a solution 
of the time-independent Sehrodinger equation for the harmonic oscillator with 
energy E -f hco. Show that a„if(x) is a solution with energy E — hco. 

(c) Show that = H — hco/2, 

(d) There is no upper bound on the possible values for £, but there is a lower bound; 
the energy cannot be negative. This means that if xj/ 0 (jt) is the ground state wave 
function, then i/o(x) = 0. Using the relation derived in part (c), show that the 
ground state wave function has energy hco/ 2, 

(e) Write out the equation a^^ix ) = 0 as a differential equation and solve it to find 
the ground-state wave function. 

6.11 A particle is confined in a cubic box with edge of length a, w ith V 7 = 0 inside the 
box. The particle is in its ground state; determine whether or not the particle is in an 
eigenstate of L : . 

6.12 Consider a three-dimensional system wath w ; ave function if . If if is in the / = 0 state, 
we already know that L z \jf = 0. Show that L x \jf = 0 and L v if = 0 as well. (Note this 
is the only exception to the rule that a wave function cannot be simultaneously an 
eigenfunction of /„ v , L v , and L-.) 

6.13 A particle is in an eigenstate of L 2 and L : , with quantum numbers / and w/. By 
symmetry, we must have {if} — {!;). Show that tri/2 < {£;) < h 2 l(I + 1 )/2. 

6.14 The "radius of the hydrogen atom” is often taken to be on the order of about 
10 10 m. If a measurement is made to determine the location of the electron for 



144 


Chapter 6 The Three-Dimensional Time-Independent Schrodinger Equation 


hydrogen in its ground state, what is the probability of finding the electron within 
10~ 10 m of the nucleus? 

6.15 (a) The electron in a hydrogen atom is in the / = 1 state having the lowest possible 

energy and the highest possible value for m\. What are the n, /, and m { quantum 
numbers? 

(b) A particle is moving in an unknown central potential. The wave function of the 
particle is spherically symmetric. What are the values of / and m/? 

6.16 The deuteron is a nucleus of ‘‘heavy hydrogen” consisting of one proton and one 
neutron. As a simple model for this nucleus, consider a single particle of mass m 
moving in a fixed sphericaily-symmetric potential V(r), defined by V(r) = — % 
for r < r 0 and V(r) = 0 for r > r 0 . This is called a spherical square-well potential. 
Assume that the particle is in a bound state with / = 0. 

(a) Find the general solutions R(r ) to the radial Schrodinger equation for r < r 0 and 
r > r 0 . Use the fact that the wave function must be finite at 0 and oc to simplify 
the solution as much as possible. (You do not have to normalize the solutions.) 

(b) The deuteron is only just bound; i.e.. £ is nearly equal to 0. Take m to be the 
proton mass, m = 1.67 x 10" 27 kg, and take r 0 to be a typical nuclear radius, 
r {) = 1 x 10- 15 m. Find the value of Vq (the depth of the potential well) in MeV 
(1 MeV = 1.6 x 10~ 13 J). (Hint: The continuity conditions at r 0 must be used. 
The radial wave function R(r) and its derivative R'{r) must both be continuous 
at r i} : this is equivalent to requiring that u(r) and w'(r) must both be continuous 
at r<>, where u(r) = rR(r). The resulting equations cannot be solved exactly but 
can be used to derive the value for Vo.) 

6.17 Determine all potentials V(r, $, <p) for which it is possible to find solutions of the 
time-independent Schrodinger equation which are also eigenfunctions of the opera¬ 
tor L z . 

6.18 A particle with mass m is confined inside of a spherical cavity of radius r 0 . The potential 
is spherically symmetric and can be written in the form: V(r) = 0 for r < r ih and 
V(r) = (X) for r = r 0 : in other words, there is an infinite potential barrier at r = 
The particle is in the / = 0 state, 

(a) Solve the radial Schrodinger equation and use the appropriate boundary condi¬ 
tions to find the ground state radial wave function R(r) and the ground state 
energy. You do not have to normalize the solution. 

(b) What is the pressure exerted by the particle (in the / = 0 ground state) on the 
surface of the sphere? 

6.19 A particle of mass m is in a three-dimensional, spherically-symmetric harmonic os¬ 
cillator potential given by V(r) = (1/2 )Kr 2 . The particle is in the 1=0 state. Find 
the ground-state radial wave function R(r) and the ground-state energy. You do not 
have to normalize the solution. 

6.20 Deuterium is an isotope of hydrogen with a nucleus consisting of one proton and one 

neutron. Let a(D) 2 ^} be the wavelength of the photon emitted when the electron in a 
deuterium atom drops from the n = 2 state to the n = 1 state, and let a(//) 2 -*i be the 
corresponding wavelength for ordinary hydrogen. Calculate a(DL i ~ i • 



CHAPTER 



Math Interlude C: Matrices, 
Dirac Notation, and the Dirac 
Delta Function 


In Chapter 5, we examined some of the general properties of vector spaces. We 
have, so far, treated functions as general vectors in an abstract vector space, with 
the inner product represented as an integral over pairs of the functions, and linear 
operators transforming one function into another. Here we examine two other 
ways of treating vector spaces. In the next section, we examine finite-dimensional 
vectors, for which linear operators are represented by matrices. In the following 
section, we introduce Dirac notation, which is simply a general means to represent 
arbitrary vectors in an abstract vector space. Finally, we discuss briefly an unrelated 
topic, but one which comes up frequently in quantum mechanics as well as in other 
areas of physics: the Dirac delta function. 


7.1 BTHE MATRIX FORMULATION OF LINEAR OPERATORS 


As we saw in Chapter 5, the set of wave functions \j/ (x) can be treated as an infinite- 
dimensional vector space with the appropriate definitions for linear operators and 
inner products. However, we will often be interested in finite-dimensional vector 
spaces. These vector spaces occur, for instance, in the study of angular momentum. 
For example, suppose that a particle has total angular momentum j = 1 /2, so that 
the possible m } states are —1/2 and +1/2. We can represent these two states as 
column vectors; for instance, ^ ^ J can represent the m j = + 1 /2 state, and ^ ) can 
represent the m ; = — 1 /2 state. In general, we will use column vectors of the form 



to represent general finite-dimensional vectors; the dimension of the vector 


space is then just the number of entries in the column. 

How is a linear operator represented using this formulation? If we are dealing 
with an n -dimensional vector space, so that the vectors are column vectors with n 
entries, then a general linear operator is an n x n matrix, and the act of operating 
on the vector is simply given by multiplying the matrix by the column vector. 

Recall how matrix multiplication works. Suppose that A is an / x m matrix, 
and B is an m x n matrix. Then the elements of the product, C = A B. are given 
by 


m 

Cjk ~ ^; AjjBjk 
i= i 


(7.1) 


145 




146 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


C A B 



FIGURE 7.1 If C = AB, then to obtain a given element in matrix C, scan across the 
corresponding row of matrix A and the corresponding column of matrix B, multiply each 
pair of numbers and add the result. 

where Ay,, for example, is the element in the j th row and the /th column of A. 
Thus, multiplication of an / x m matrix by an m x n matrix yields an / x n matrix, 
and such multiplication is possible only if the number of columns in A is equal to 
the number of rows in B. 

More graphically, we can think of a single element in matrix C being gener¬ 
ated by scanning across the corresponding row of matrix A and the corresponding 
column of matrix B, multiplying each pair of numbers and adding the result (Fig¬ 
ure 7.1). 


Example 7.1. Matrix Multiplication Is not Commutative. 

Consider the two matrices A = ^ ^ j and B = ij jj j. Using the rules for matrix 

multiplication gives 


while 

^ = (7 

A column vector with n entries can be treated as an n x 1 matrix. Hence, mul¬ 
tiplying an n x n matrix by an n x 1 column vector yields another n x 1 column 
vector. Thus, for a vector space consisting of //-dimensional column vectors, the 
operators are n x n matrices: 



/ A\\ A 12 A 13 \ / .V i \ />' t\ 

I A 21 A 22 A 23 } ( Xi I = ( V? | 

\ A 31 A 32 A 33 / \ .V 3 / \ V 3 / 

The multiplication of two operators represented by the matrices A and B corre¬ 
sponds to multiplication of the two matrices. Since matrix multiplication is not 
commutative (Example 7.1). it will not be true, in general, that any two operators 
will commute, as we have already noted for many quantum mechanical operators. 

We can also find the eigenvalues and eigenvectors of an operator represented as 
a matrix. Suppose we have a matrix A multiplying a column vector x, and assume 
that one of the eigenvalues is c. Then the eigenvalue equation is 



Ax = cx 




7.1 The Matrix Formulation of Linear Operators 


147 


Because we are dealing with matrices, this can be written as 

Ax = clx (7.3) 

where / is the idefitity matrix with 1 's along the main diagonal and 0 \s everywhere 
else. The identity matrix has the property that lx = x for any column vector x. For 
example, in three dimensions, 


1 0 0\ /x t \ 

0 1 Oil x 2 
0 0 l/\jc 3 y 

Then Equation (7.3) can be rewritten as 



(A ~ cl)x = 0 


(7.4) 


In three dimensions, for example. Equation (7.4) corresponds to 


( 'An - c A i2 Ai3 \ /x\ \ /0\ 

A 2 1 A 22 — C A23 J I A'2 I = ( 0 J 

A 3 j A 32 A 33 — C / \ A ‘3 / \0 / 

A matrix equation of this form always has a trivial solution of the form 



be., a vector for which all of the entries are zero. This is not a very interesting 
solution. In order for Equation (7.4) to have a nonzero solution, the determinant 
of A — cl must be zero. For example, in three dimensions, the condition for a 
nonzero solution is 


An —c 

A 12 

A 13 

A 2 i 

A 22 — c 

A 23 

o 

A 32 

A 33 — C 


An n x n determinant of this kind, in which each term on the main diagonal 
is of the form An — c, corresponds to a polynomial of degree n in the variable 
c. Thus, there will always be n complex values of c for which the determinant is 
zero, corresponding to n complex eigenvalues (not necessarily all distinct). The 
first step is to find these eigenvalues; they can then each be inserted, in turn, into 
Equation (7.2) to find the corresponding eigenvectors. Note from Equation (7.2) 
that an eigenvector multiplied by an arbitrary constant will remain an eigenvector 
with the same eigenvalue, a result that is already familiar from our earlier work 
with eigenvectors. 




148 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


Example 7.2. The Eigenvalues and Eigenvectors of the Matrix A — 

Assume an eigenvalue c. Then the determinant equation is 



Expanding out the determinant gives 

c 2 - 1 = 0 


so 


c = ±1 


Thus, the eigenvalues are c = — 1 and c = 1. These can now be inserted into the 
eigenvalue equation. First, for c = 1, 



which yields the two equations: 

X 2 = X| 

*1 = * 2 

Note that these two equations are not independent. This will always be the case, 
since there must always be one extra degree of freedom, which allows multiplica¬ 
tion of the final solution by an arbitrary constant. If a set of completely independent 
equations has been obtained (so that every component of the vector x can be cal¬ 
culated exactly), then the eigenvalues have been calculated incorrectly. Now we 
fix X] to any value, and x 2 will be determined. So, for example, we can choose 
Xj = 1. which gives x 2 = 1. yielding the eigenvector ( j j, with eigenvalue c — I. 
Now consider c = — 1. For this case, we get 

so 

x 2 = —X| 

X i = —x 2 

Again, these are not independent equations. We can take, for example. X| = E 
which gives x 2 = — 1, yielding the eigenvector ( J j. However, we could just as 

easily have taken X| — —1, giving x 2 = +1 and producing the eigenvector j J )• 
It would appear that we have found two different eigenvectors. However, this is 



7,1 The Matrix Formulation of Linear Operators 


149 


not the ease: ( J ) and ( ~| J are two different representations of the same vector; 
they differ by an overall multiplicative factor of — L 



This is identical to the familiar dot product of two vectors, except that all of the 
elements of the first vector are complex-conjugated (which will matter only when 
they are not real numbers). This definition of the inner product can be used to 
normalize any «-dimensional column vector. In order that % be normalized, we 
require <xjx> = 1 , which corresponds to 



Example 7,3, Normalizing the Vector ^ a 


We multiply this vector by the constant c\ and then require that Equation (7.6) be 
satisfied: 



Multiplying out the matrices gives 

Icf(l +9 + 4) = 1 

for which we choose the positive real solution, c == l/\/*14. As in our previous 
derivation of normalization constants, there is an arbitrary phase factor, i.e,, c 
can be multiplied bv e l(p . where (f> is any real number, and the vector will still be 
normalized. However, it is usually easiest to take c to be real and positive, which 
we will do here. Hence, the normalized vector is 






150 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


If we insert an operator into the inner product, we end up with the product of 
three matrices: 


(x|Ay) = U* x\ 




Using these definitions, it is possible to determine the adjoint of a matrix operator. 
Recall that the adjoint A ! of the operator A has the property that 


(x|/4y) = (A + x|y) (7.7) 

for all x and y. Using Equation (7.5) to write Equation (7.7) in matrix form, and 
expanding the matrix products out in terms of their components as in Equation 
(7.1), gives 


ft n 

x i A ‘jyj = H (AjjXj)** 

t,j=\ i.j =i 

Note that matrix elements are just numbers, so they do commute, and we can 
rewrite the right-hand side of the equation: 


J2 x ' a u>'j = Y1 x j a 

ij =1 


,, i 

ij - 


M 


E*»J. 

;.«=i 


(7.8) 


where, in the last line, we have simply interchanged the summation labels i and /. 
Equation (7.8) implies that 


4 t — A* 
A ‘l A ji 


so that the adjoint of a matrix operator corresponds to the conjugate transpose of 
the matrix, i.e., the interchange of rows and columns and the complex-conjugating 
of all of the entries. 


Example 7.4. Determining the Adjoint of a Matrix. 

Consider the matrix operator 

1 


1 '■ ^ 
-3 e '*) 


where 9 is a real number. What is the matrix corresponding to the adjoint opera¬ 
tor L f ? 



7.2 Dirac Notation 


151 


First transpose the matrix to get | 1 ^ Then take the complex-conjugate of 
all of the entries to obtain the adjoint operator; 



The matrix corresponding to a Hermitian operator must be self-adjoint. Thus, 
such a matrix must be equal to its conjugate transpose. For a real matrix, this means 

that the matrix is symmetric, e.g., (\ But Hermitian complex matrices need 

not be symmetric. For example, the matrix ( 0 j corresponds to a Hermitian 
operator. 


7.2 ■ DIRAC NOTATION 

In Chapter 5 we introduced the properties of an abstract vector space, as well as 
operators and inner products acting on that space. We then made use of a special 
case in which the elements of the vector space were functions, and the inner product 
was simply an integral: 


(/Is) 


j f (x)*g(x) dx 


However, as noted in the previous section, another perfectly acceptable example of 
a vector space is the set of column vectors for which the inner product is, instead. 


f y '\ 


(x|y) — (,r* x* 2 


x *) 

n / 


}’2 


V w / 


Often, however, we will want to deal with general vector spaces and general inner 
products, without reference to whether we are talking about functions, column 
vectors, or some other special class of vectors. In fact, we have already developed 
the abstract notation to deal with general vector spaces, operators, and inner prod¬ 
ucts in Chapter 5. This notation is commonly in use in the world of mathematics. 
In quantum mechanics, however, most physicists use a slightly different notation 
called Dirac notation . which we now discuss. 

In Dirac notation, a general vector is written like this; h//). These vectors satisfy 
all of the properties of a vector space, as defined in Chapter 5. e.g., the sum of two 
vectors is a vector: 


IVri) + IV^) = I ) 




152 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


and the product of a vector and a complex number is a vector: 


C\4> l) = 102) 


To represent the inner product of two vectors, 10) and 10), we simply write (0|0). 
Now suppose that the vectors |0i), |02), • • • , |0„) represent an orthonormal basis 
set for our vector space, i.e., 


(0il07') = S U 

We saw in Chapter 5 that an arbitrary three-dimensional vector r could be expressed 
as a sum of orthononormal basis set vectors as 

r = (r • x)x + (r • y)y + (r • z)z 


The generalization of this to arbitrary vectors in Dirac notation is 


10) = 101) (0110) + 10 2 > <0210 > + ••• + I0n)(0nl0) 


(7.9) 


Note that in this equation, the constants multiplying the basis vectors are the inner 

products (0| |0), ( 02 10).(0„ |0>; we have written these constants to the right 

of the basis vectors themselves, |0|), |02), .... |0„). This is completely equivalent 
to the notation developed in Chapter 5, where we used (0|0) to represent an inner 
product. We have not introduced any new mathematics so far; instead, we have 
rewritten the ideas developed in Chapter 5 in this new notation. 

Now we introduce a new property of vector spaces. Suppose we have an inner 
product of two Dirac vectors, givpn by 


(010) 

The right-hand part of this inner product is just the vector |0). But suppose we 
detach the left-hand side, and write it as the quantity (0|. Does this have any 
meaning at all? We can combine (01 with any vector 10) to get the complex 
number (0|0). Thus, the quantity (01 is a function which maps the vector space 
into the set of complex numbers. It is clumsier to express this concept in terms 
of the inner product notation developed in Chapter 5; in that notation we would 
need to write (0| as (01 ), where any vector could be inserted in the blank space 
to produce a complex number. It turns out that this set of all mappings of the 
vector space into the set of complex numbers has all of the properties of a vector 
space itself, so it is given a special name: the dual space. Given a set of vectors 
101 ), | 02 ),..., | 0 , t ), the dual space consists of (0i |, (02 1...., (0„|, with all of 
the properties of a vector space, namely, the sum of two elements of the dual space 
is a member of the dual space: 


<011 + (021 = <031 



7.3 The Dirac Delta Function 


153 


as is the product of a complex number with a member of the dual space: 

C(<h\ = (<p2\ 

The inner product in Dirac notation (0|0) is sometimes called a bracket, so that 
(0i I, ( 02 l> • • •, (0„| are called bra vectors, and |0i), | 02 >, • • •, |0„) are called ket 
vectors. 

With this new idea of dual vectors. Equation (7.9) can be written in a particularly 
elegant fashion: 




In Dirac notation, linear operators behave in the standard way, e.g., if P is a 
linear operator, we have 


P(c\iJf)) = cP\f) 


and 


p{w [ ) + m) = p\if\) + pm 


Inner products with an operator inserted are written as (0|P|0); this is equivalent 
to (0| Pty) in the notation of Chapter 5. Finally, the dual of the vector P\\jr) is 
simply (0-| P f . 

It is important to remember that vectors and operators in Dirac notation are 
completely general; they will sometimes be used to represent finite-dimensional 
vector spaces, for which the operators can be represented as matrices and the 
vectors as column vectors, but they can also represent infinite-dimensional vector 
spaces as well. For example, the time-independent Schrodinger equation is simply 

H\1t) = E\f) 

Sometimes this will represent the familiar Schrodinger equation with derivatives 
of wave functions, but sometimes it will represent matrix quantities, as in the next 
chapter. 


7.3 BTHE DIRAC DELTA FUNCTION 

In this section we will introduce a function that comes up frequently in physics 
and has some rather remarkable properties: the Dirac delta function. The delta 






154 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


tS(.v) 

4 

I 

I 


X = 0 

FIGURE 7.2 The delta function is zero everywhere except at x = 0, where it is infinitely 
peaked. 




FIGURE 7.3 The delta function can be treated as the limit of a set of functions with unit 
area but decreasing range. 


function, written 5(.v), has the following properties: 

(7.10) 

(7.11) 

(7.12) 


Although the range of these integrals has been taken from — oc to oo, these equa¬ 
tions are valid as long as the range of integration includes the point x = 0. 

A function which can satisfy these equations must be very strange indeed. It 
is zero everywhere except at x = 0, but in order for it to integrate to 1 (Equation 
7.11), it must be infinitely sharply peaked at the origin (Figure 7.2). In fact, from 
a mathematician’s point of view, the delta function is not really a function at all, 
but it can be defined in a rigorous way as the limit of a set of functions of constant 
(unit) area, but with an increasingly narrower range (Figure 7.3). The property for 
which the delta function is primarily used is given by Equation (7.12): the integral 
of the product of a delta function with any other function fix) “picks out" the 
value of fix) at the origin. If we wish to pick out the value of f (x) at some other 


<5(x) = 0, for x 0 


L 

L 


S(x)dx = 1 
S(x) fix) dx = /(0) 




7.3 The Dirac Delta Function 


155 


value of x, a change of variables gives 


poo 


I 8(x — a) f 

'(x) = f(a) 


(7.13) 


The Kronecker delta, which we have already encountered, is a discrete version 
of the delta function; the expression corresponding to Equation (7.13) for the 
Kronecker delta is 



— c n 


We can also define a three-dimensional delta function, <5 3 (r), given by 

5 3 (r) = 8(x)8(y)8(z) 


Then the equation corresponding to Equation (7.13) in three dimensions is 


J S 3 (r — r 0 )/(r) d 3 r = f (r 0 ) 


where the range of integration must include the point ro- Thus, the integral over 
the three-dimensional delta function picks out the value of the function at a single 
point in three-dimensional space. 

The delta function is useful in representing the kinds of idealized point distribu¬ 
tions that are frequently encountered in physics. For example, consider the charge 
density, p(r), produced by a point charge e at the position r 0 . This charge density 
can be written in terms of a delta function: 


p(r) = e<5 3 (r - r 0 ) 


This charge density is infinite at the point 1*0 and zero everywhere else, but the total 
charge Q is well-defined: 


Q — I p(r)d 3 r = I eS 3 (r ~ tq) d 3 r = e 





156 


Chapter 7 Math Interlude C: Matrices, Dirac Notation, and the Dirac Delta Function 


EXERCISES 

7.1 The operator Q is given by the matrix: 

e “(-< -i) 

(a) Determine the matrix corresponding to Q\ 

(b) Is Q Hermitian? 

(c) Find the eigenvalues of Q. 

(d) For each eigenvalue in part (eh determine the corresponding eigenvector. 
7*2 The operator A is given by the matrix: 

( l 0 »\ 

A = I 0 0 0 ) 

VI 0 1 / 


(a) Is A Hermitian? 

<b) Find the eigenvalues and corresponding eigenvectors. 

(c) What is unusual about the eigenvectors corresponding to the eigenvalue c = 0? 

7.3 Suppose that an n x n matrix A is diagonal so that A !} = 0 for i ^ j\ but the diagonal 
elements A\ u A 22 ,... need not be zero. Assume that An =£ A 22 # ... A nn . Find the 
eigenvalues and eigenvectors of this matrix. 

7.4 The trace of a matrix A. written tr(A ), is defined to be the sum of its diagonal elements: 

ti 

tr(A) = Y^ A <i 

M 

(a) Show that for any two square matrices, rr(AB) = tr(BA). 

(b) Show that for any matrix A. the trace is equal to the sum of its eigenvalues (where 
multiple eigenvalues must be included in the sum multiple times). 

7.5 Normalize these vectors: 

7.6 A particle is in the state j<$), and let (>, \^z) .IVr?) be an orthonormal basts for 

the vector space which contains |0). Q is a Hermitian operator. Show that 

n 

(Q 2 ) = J2mQ\fj)\ : 

7.7 Suppose that |^i), Wi) .|*A«) is an orthonormal basis set, and all of the basis 

vectors are eigenvectors of the operator Q with Q\xj/ } ) = qj\i*j) for all j. A particle 
is in the state \<p). Show that for this particle, the expectation value of Q is 

j=i 





Exercises 


157 


7.8 If the operator U has the property that WU = / (where / is the identity operator), 
then U is called a unitary operator. Show that if ji/o), Wi), ■ * • * are a set of 
orthonomial vectors, then E/]^ } ), J7j^ 2 ). * • •, V\\j/ n ) are also a set of orthonorinal 
vectors. 

7.9 Suppose that U is a unitary operator, as defined in Exercise 7.8, and U is represented 
by a matrix. Show that the columns of V form a set of orthonormal column vectors. 

7.10 A particle is in the state \4>). Let |^i), Itfo). • • * *Wn) be an orthonormal basis for the 
vector space which contains |</>), and assume that all of these basis vectors are eigen¬ 
vectors of H with H\iffj} = Ejlt/fj} for all j. Suppose that the operator Q satisfies 

<0) = ]T£JW> \Z\* m )\ 2 

m 

where Z is just the usual position operator in the z direction. Derive an expression for 
Q as a function of H and Z, hut not containing E m . 

7.11 The delta function is sometimes represented as 

i r x 

5 ( jc )=— / e ikx dk 
2 n J 

Show that this definition satisfies the properties of the delta function given in Equations 
(7.10M7.12). 

7.12 Show that 


7.13 Show that 


8(ax) = 


8(x) 

l^T 



f(x)8(g(x))dx 


f(x o) 

I dg/dx\ I=Hl 


where xq is determined by g(xo) = 0. 





C HA PT E R 



Spin Angular Momentum 


In Chapter 6 we noted that there are two types of angular momentum most relevant 
in quantum mechanics: orbital angular momentum, produced at the classical level 
by the physical motion of a particle, and spin angular momentum, which represents 
a type of angular momentum internal to the particle. Here we will examine the 
latter in more detail. Although it is tempting to think of “spin” as being produced 
by the actual internal rotation of a particle, this is misleading. It is more accurate 
to treat spin as an intrinsic property of the particle, like charge or mass. 


8.1 ■ SPIN OPERATORS 

All of the formalism derived for angular momentum operators in Chapter 6 carries 
over to spin operators. In particular, the operators S x , S y , and S z give the compo¬ 
nents of spin angular momentum in the x, y, and z directions, respectively, while 
S 2 = S 2 + S 2 -f 5? is the operator corresponding to the square of the total angu¬ 
lar momentum. Further, these operators satisfy the standard angular momentum 
commutation relations derived in Chapter 6: 


l VV , Sy) = ikS Z 

[S Z . 5,] = ihS y 
[Sy, s z ] = ihs x 


( 8 . 1 ) 

( 8 . 2 ) 

(8.3) 


As in the case of orbital angular momentum, it is impossible to measure all three 
components of spin simultaneously, but it is possible to measure a single compo¬ 
nent of S and the total squared angular momentum S 2 . As before, we will normally 
make the somewhat arbitrary decision to measure the z component of angular mo¬ 
mentum. Then a particle which is in an eigenstate of the operators S 2 and S z has 
a wave function | \j/) satisfying 


S 2 |i/0 — fi 2 x(s -I- 1)1^) 

S-W = ^m s )f) 


(8.4) 


However, there are also significant differences between spin angular momen¬ 
tum and orbital angular momentum. First, recall that the eigenfunctions of orbital 

159 




160 


Chapter 8 Spin Angular Momentum 


angular momentum could be written as spatial wave functions, namely the spher¬ 
ical harmonics Y["(B,(p). There are no spatial wave functions corresponding to 
eigenstates of spin, since spin is a purely internal property of a particle. Second, 
the total spin eigenvalue s has a fixed value for any given particle, and, unlike or¬ 
bital angular momentum, it cannot be increased or decreased for a single particle. 
For example, the electron in a hydrogen atom can attain arbitrarily large values 
for l (for arbitrarily large values of n). However, its spin is always 5 = 1/2. This 
is one reason that viewing spin as the physical rotation of a particle is misleading; 
an elementary particle cannot be “spun up’’ to obtain larger values of spin. Finally, 
we saw in Chapter 6 that the orbital angular momentum quantum number was re¬ 
stricted to integer values. This is not true for spin; the spin quantum number can take 

on the full range of both integer and half-integer values: 5=0, 1/2, 1,3/2,_ 

All elementary particles in nature have an intrinsic spin, given by the 5 quantum 
number of Equation (8.4). In fact, spin is such a fundamental property that particles 
are classified according to their spin: particles with integer spin (5 = 0, 1,2,...) 
are called bosons, while particles with half-integer spin (5 = 1/2, 3/2,...) are 
called fermions. The properties of fermions and bosons in multi-particle collec¬ 
tions are fundamentally different, as we will see in Chapter 13. All of the particles 
making up the ordinary matter in the universe, i.e., protons, neutrons, and elec¬ 
trons, have 5 = 1/2 and are therefore fermions. Bosons include the photon (5 = 1), 
the pion (5 = 0), and the graviton (5 = 2), which is hypothesized to transmit the 
gravitational force. 


8.2 ■ EVIDENCE FOR SPIN 


What is the reason for believing that spin angular momentum even exists? Today, 
the entire edifice of particle physics is built on the idea that particles have spin, so 
there are countless experiments confirming it. But there were two main pieces of 
experimental evidence available to the pioneers of quantum mechanics, and both 
are based on the same mechanism: the link between angular momentum and the 
production of magnetic fields. 

Consider first the case of orbital angular momentum, and recall how magnetic 
fields are generated by a classical charged particle moving in a circular trajectory. A 
classical particle with charge q, moving in a circular orbit of radius r with velocity 
of magnitude v (Figure 8.1) produces the current 


and the magnetic moment is 


/x = IA 


q v 
ItTk 

qvr 


2 


2 



8.2 Evidence for Spin 


161 



FIGURE 8.1 A particle of charge q moving in a circular orbit of radius r at velocity of 
magnitude v produces a magnetic moment ju = qvr/2. 


Further, the classical angular momentum is 

L — mvr 


so that the relation between the magnetic moment and the angular momentum for 
a classical charged particle in a circular orbit is 



For an electron with charge q = —e, this expression gives 



(8.5) 


It is conventional to write the magnetic moment in terms of the Bohr magneton , 
which is defined by 


Ms 


eh nA 

-— = 9.3 x 10 -24 
2 m e 


Am 2 


Then the expression for the magnetic moment of an electron can be rewritten as 



Note that L and h have the same units, and Hb has units of magnetic moment. 
Now recall that both the angular momentum and the magnetic moment are vectors. 
Further, we insert an extra factor gi into the expression for the magnetic moment, 
setting gi — 1. This gives 



(8.6) 





162 


Chapter 8 Spin Angular Momentum 


Why did we insert this extra factor g/. only to set it to one? The reason is that when 
we generalize our result to include spin angular momentum, we will have another 
factor g iS , but it will not necessarily be equal to one. Using g/ in Equation (8.6) 
preserves the parallel between the orbital magnetic moment and the spin magnetic 
moment. 

Arguing from analogy to Equation (8.6), we now postulate that the spin angular 
momentum will also generate a magnetic field with magnetic moment of the form 



(8.7) 


It is observed experimentally that Equation (8.7) does, in fact, apply to the electron, 
but now g s 1. The value of g s for the electron is. in fact, one of the best measured 
quantities in nature (S. Eidelman, et al., Physics Letters B592, 1,2004): 

g, = 2.0023193043718 ± 0.0000000000076 

This measured value for g s - leads to some obvious questions: Why is it almost 
exactly equal to 2? And why is it not exactly equal to 2? It turns out that the 
answers to both of these questions are very significant. The answer to the first 
question can be derived from the Dirac equation, which is the basic equation of 
relativistic quantum mechanics (Chapter 15). The Dirac equation predicts that g s 
should be exactly equal to 2. The small deviation from 2 is a real effect, but it was 
not explained until the invention of quantum field theory' by Richard Feynmann and 
others in the 1940’s. This small deviation from 2 is called the anomalous magnetic 
moment of the electron, and its calculation is far beyond the scope of this book. 
For most practical purposes, it is sufficient to take g s = 2. 

These derivations of /tt, and fi s are based for /z/, on purely classical arguments, 
and for /z v , on analogy from the /i; derivation, so why should they be believed? 
The evidence, as always, is based on experiment. The first piece of evidence comes 
from the energy levels of the atom. Our derivation of the hydrogen energy levels 
in Chapter 6 is incomplete. In the spectrum of hydrogen, for example, experimen¬ 
tal evidence shows that certain of the degenerate energy levels are not, in fact, 
completely degenerate but are separated by a tiny amount in energy. This can be 
explained by the interaction between the spin magnetic moment of the electron and 
its orbital magnetic moment; this interaction splits the degenerate levels apart in 
energy. This calculation will be performed in detail in Chapter 9 after the necessary 
mathematical machinery has been developed. Under the assumption that the orbital 
and spin magnetic moments are given by Equations (8.6) and (8.7), respectively, 
this splitting can be predicted correctly. Obviously, this mechanism will work only 
if the electron has hath an orbital magnetic moment and a spin magnetic moment. 
The former will be produced by any charged particle with orbital angular momen¬ 
tum, but the latter makes sense only if the electron has its own intrinsic angular 
momentum. This explanation was first proposed by Goudsmit and Ulhenbeck in 
1925, and it is now known to be correct. 




8.2 


Evidence for Spin 


163 



FIGURE 8.2 The classical bar magnet on the left will be attracted upward in the inho¬ 
mogeneous magnetic field, while the magnet on the right will be repelled downward. 


A “cleaner” piece of evidence for the spin magnetic moment of the electron (and, 
therefore, for the spin of the electron) comes from the Stern-Gerlach experiment. 
Consider what happens to a classical bar magnet placed in an inhomogeneous 
magnetic field, like the one shown in Figure 8.2. In this figure, the field strength 
increases in the upward direction. The small magnet on the left will be attracted 
upward, since the field strength at the S pole, which pulls the magnet up, is larger 
than the field strength at the N pole, which pulls the magnet down. The magnet on 
the right is repelled in both directions, and the same argument shows that it will 
be pushed downward. 

Now consider an ideal dipole p in an inhomogeneous field. The force exerted 
on the dipole is F = — VV\ where V is the potential energy of the dipole in the 
magnetic field. Assume for simplicity that B is in the z direction, so 



The potential energy of a magnetic dipole p in a magnetic field B is 

V = —p * B 

Therefore, the force on the dipole can be written as 

„ dB- A 

F = — -p z z (8.8) 

dz 

Imagine that we shoot a beam of atoms into the page through the magnetic field 
shown in Figure 8.2 and measure their deflection on a screen behind the magnetic 
field. Equation (8.8) says that the force should be proportional to the z component of 
the magnetic moment of the atom. In the absence of spin, Equation (8.6) indicates 



164 


Chapter 8 Spin Angular Momentum 


that this z component should be 


So in the absence of spin, the force, and therefore the deflection of each atom, 
should depend on the z component of the orbital angular momentum. If the atom 
is in an eigenstate of L- with quantum numbers / and /«/, then 

L-\l mi) — hnii\l mi) 

so the deflection should depend on m/. Since m/ can range from —l to +/, we 
expect that the atoms would be split into 21 + 1 discrete beams. 

Stern and Gerlach actually performed this experiment with sil ver atoms in 1922 
and discovered that the beam formed two bands on the screen behind the magnet; 
in other words, the inhomogeneous magnetic field split the beam into exactly two 
separate beams. This experiment was repeated using hydrogen atoms by Phipps 
and Taylor in 1927. Here the experiment is even clearer in its predictions: hydrogen 
in its ground state has / = 0 and m/ = 0, so if orbital angular momentum is the only 
source of magnetic moments in the hydrogen atom, there should be no deflection 
in the inhomogeneous magnetic field at all. However, Phipps and Taylor also got 
the same answer: the beam of hydrogen atoms was split into two bands. 

This result can be explained if the electron has spin angular momentum. Assume 
that the electron in the hydrogen atom has spin s. Then the z component of the 
spin magnetic moment is 


S z \s m s ) — hm s \s m s ) 

To get splitting into two beams requires that m s take on exactly two possible values, 
namely, m s = ±1/2. This will be true if the electron has s — 1/2. 


8.3 ■ ADDING ANGULAR MOMENTUM 

The electrons in an atom have both orbital and spin angular momentum. Even in 
the simplest atom, hydrogen, both types of angular momentum will be present. It 
is important, therefore, to understand how to calculate the total angular momen¬ 
tum, including both the orbital and spin components. Classically, the addition of 
angular momentum is straightforward: it corresponds to a vector sum of the two 
individual components of angular momentum. In quantum mechanics, things are 
more complicated. We will define the total angular momentum operator J as the 
sum of the L and S operators: 


J = L + S 


(8.9) 




8.3 Adding Angular Momentum 


165 


An eigenstate of total angular momentum and of the z component of total angular 
momentum is written as |y m ; ) with 


J 2 \j tn j) = H 2 j(j + 1)| Jff!;) 

J z \j m j) = hm.j\j nij) 


( 8 . 10 ) 


Suppose that we have a state with quantum numbers /, m/, 5, m s , and we would 
like to calculate the possible values of j and mj. Clearly, J 2 ^ L 2 + S 2 , so there 
is not a simple relationship between j, l, and s. However, it is true from Equation 
(8.9) that 


J z = L z + S z 

Thus, if |nil m a) is an eigenstate of both S : and L z such that 

L z \m\ m s ) = m,} 

S z \mi m s ) = hm s \mi m, v ) 


then 


J z \mi m s ) — ( L z + S z )\m/ m s ) = h(mi + m s )\mi m s ) 

A comparison of this equation with Equation (8.10) shows that the z-component 
quantum numbers are additive: 


mj = mi + Mj 


( 8 . 11 ) 


This result also provides a clue as to the possible values of j. Since mi < l and 
m ,f < s, Equation (8.11) implies 


mj < l + s (8.12) 

Since mj < j, Equation (8.12) will automatically be satisfied as long as 

j <1 + s 

This quantum mechanical relation agrees with the upper bound on the classical 
value of |J|: classically, the largest possible value of |J| occurs when L and S 
are parallel, so that |J| = |L| + |S|. The smallest possible value of |J| in the 
classical case occurs when L and S are pointing in opposite directions, so that 
|J| = ||L| — |S||. This suggests the quantum analog 

j >\l~s\ 

Note further that j can vary between \l — s \ and / + .v only in integer steps, not half¬ 
integer steps. The reason for this can be derived from Equation (8.11). Suppose that 





166 


Chapter 8 Spin Angular Momentum 


j and inj both have their maximum values, j = / + s and in j = nij + m s . Since 
mi and m s vary by integer steps between their maximum and minimum values, the 
next largest value of m, possible is m } — m / + m s — 1. However, if j could vary 
by half-integer steps, we would expect to see in , = /«/ + m s — l /2, which does 
not exist. 

To summarize, the possible values of j are 


j = |/ — s|, \l — .v| + 1./ + -v — 1, l + s 


(8.13) 


Example 8.1. Adding Angular Momenta in the Hydrogen Atom. 

The electron in a hydrogen atom is in an / = 1 state. What are the possible values 
of j and nij ? 

The electron has s = 1/2 and l — 1, so Equation (8.13) gives j = 1/2 or j = 
3/2. In the usual way, m ; can vary from — j to + j in integer steps, so the possible 
values of j and mj are 


7 = 1/2, m j = —1/2. +1/2 

and 

j = 3/2, nij = -3/2, -1/2, +1/2, +3/2 


In general, if the electron in a hydrogen atom has quantum number /, then the 
possible values for j are j —l — 1/2 and j = l + 1/2. The exception to this is the 
case / = 0 for which j = 1 /2 is the only possible state. 

Nothing in this derivation is peculiar to orbital and spin angular momenta; these 
arguments apply to the addition of any angular momenta. For instance, for two 
particles with spins s\ and the possible values for the total spin quantum number 
s are 


S = Is I — S 2 I, |.V 1 — S 2 I + 1, . . . , Si + S2 — 1, S| + S2 

A frequently encountered situation is that of two particles each with spin 1 /2. such 
as the electron and proton in a hydrogen atom. In this case, the possible total spin 
states are s = 1 and s = 0. The s = 1 state is called the triplet state because it has 
three possible values for m s (i.e., —1,0, and +1), while the s = 0 state is called 
the singlet state because it can have only m s = 0. 

8.4 BTHE MATRIX REPRESENTATION OF SPIN 

In this section we will develop a mathematical representation of spin states. Un¬ 
like orbital angular momentum, the spin states cannot be written as functions of 
position. Instead, they can be represented as column vectors. 





8.4 The Matrix Representation of Spin 


167 



m s - -1/2 m s = +1 12 

FIGURE 8.3 For a particle with 5 = 1 /2, the state m x = —1/2 is spin down, and m s = 
4-1/2 is spin up. 


Consider first a particle, such as the electron, which has s = 1 /2. It has two 
possible values form* which are m s = —1/2 and m s = +1/2. Since represents 
the z component of spin, it is convenient to think of these states as two different 
orientations of the angular momentum vector, namely, spin “down” and spin “up” 
(Figure 8.3). These states can be represented in Dirac notation as | l ) for spin 
down and | f ) for spin up. Thus, we have 

s z n ) = ~i 4-) 
t) = +|i t) 

There are only a finite number of these states (i.e., two). This suggests that they can 
be represented in terms of a two-dimensional vector space consisting of column 
vectors. Thus, the spin up state is written as 


I t > <=> 



and the spin down state as 


U ) ^ 



These two vectors form an orthonormal basis set, since 


{ tit ) = (1 0 )(i) = I 
(ill ) = (0 1) J) = l 
( t U > = (1 0)Ci=0 
( lit > = (0 1)C)=0 





168 


Chapter 8 Spin Angular Momentum 


Although the spin 1/2 case will be investigated in the most detail, this matrix 
representation can be extended to other spin states. For example, for s = 1 theie 
are three spin vectors, namely. 


|.v = I. m s = l)<s» 


|.v = 1, m s = 0) <3> 


s ~ 1. m s = — 1} <=> 



Now consider the spin operators S x - 5 V , and S z . From the discussion in Chapter 
7, these should be 2 x 2 matrices, which we will now calculate. Consider an 
arbitrary 2x2 matrix A. It is possible to pick out the individual elements of the 
matrix by the following sorts of matrix multiplications: 



and so forth. In general, for a matrix of arbitrary size, the element A,-, can be picked 
out by multipling on the left side with a row vector having 1 in the t'th entry and 
0 ’s everywhere else, and on the right by a column vector having 1 in the j th entry 
and O’s everywhere else: 



Since the basis vectors | | ) and | t ) are eigenvectors of the operator S,. it is 

straightforward to use this method to calculate the matrix elements of S-: 

fi Pi 

5,n - { t IS,I f ) = ( f I —I t ) = fit > = r 


-ci: = (tis-u ) = < ti-^u ) = “( m > = o 

S-2] = { ) = { ll~| t } = ^{ lit )=0 

h h h 

5,22 = { I 15, 1 i ) = { i\ - - \ l ) = --< III ) = -% 



8.4 The Matrix Representation of Spin 


169 


Thus, S z is given by the matrix 




Example 8.2. Using the Matrix Representation of S z . 

Here we verify that S z gives the correct result when operating on | l ) and | t )• 
The matrix representations for S z , | i }, and | | ) give 


S*\ t ) 


h/2 

0 


0 

-h/2 


h( l 

2 l 0 


-I f} 


and 


S z \i ) = 


h/2 

0 


0 

-h/2 


)(:)-;(!) 


'I I ) 


so S z does indeed give the correct result when applied to | | ) and | t )* 


Obtaining matrix representations for S x and S y is a bit more complicated. To 
do this, we make use of the spin ladder operators, which are the analogs of the 
operators defined in Chapter 6 for orbital angular momentum: 

5+ = Sx + i Sy 

S_ = S x — i Sy 


As in Chapter 6, these operators raise and lower the values of m s , and they give 0 
when attempting to raise m s above its highest allowed value or attempting to lower 
m s below its lowest allowed value. Thus, 

5L|s m s ) qc | s m s — 1) (8.14) 

m s ) oc |5 m s + 1} (8.15) 

The constants of proportionality in Equations (8.14) and (8.15) need to be deter- 

' . 4 * ; *|* # 

mined. Note that S_ = 5 + and S + = 5^, so if 

5-|j m s ) = c\s m s — 1) (8,16) 

where c is a constant to be determined, then the dual of this equation is 

(s m s \S+ = {s m s — l\c* (8.17) 

and taking the inner product of the quantities in Equations (8.16) and (8.17) gives 
(s m s \S+S-\s m s ) = c*c{$ m s — l\s m s — 1) = |e| 2 (8.18) 



170 


Chapter 8 Spin Angular Momentum 


In order to determine e, we must express S+5_ in terms of operators w hose 
behavior, w'hen applied to js m s ). is known. Note that 

5+5_ = (S t 4- iS y )(S x - iSy) 

= 5 ; + Sj ~ i | S x , Sy] 

= S; + S;~ HihS z ) 

= S 2 - S: + h S- 


and we know how S 2 and S : operate on js m s ). Thus Equation (8.18) can be written 
as 

(.v m s \S 2 — S: + hS z \s ma) = jti 2 


which gives 


|c| 2 = h 2 [s(s + I) - m] + m s ] 


Hence, Equation (8.16) can be written as 


S- 1 5 m.v) = t) v/.v(.v + 1) — m. v (m. s — l)|s m s — 1} 


A similar argument gives 


5+|s m s ) = hs/s{s + 1) — m s (m s + l)|s m s + 1) 


(8.19) 


For the special case of s = 1/2, we get 5_| f ) = h\ l ) and S+| j ) = ti\ f ). 

These expressions for 5+ and 5_ can now be used to determine the matrix 
elements for S x and 5 V , because w'e have explicit expressions for the way in which 
5+ and 5.... operate on | j ) and | f ). and S x and S y can be written as 


S x = ^(5+ + S_) (8.20) 

S v = i 7 (S + -S_) (8.21) 

2 1 

The matrix elements of 5+ are 

5+n = ( t 15+1 f ) = 0 (because 5+| f } = 0) 

5+12 = ( tl5+|| > = ( t 1*1 t >=/*< tit )=h 

5+2i = ( i 15-i-1 t > = 0 (because 5+| f ) = 0) 

5+22 = ( I 15+1 | | \h\ t ) = 0 (because < f | t > = 0) 

Similarly, for the matrix representing S_, 


5-21 = h 



8.4 The Matrix Representation of Spin 


171 


and all of the other matrix elements are 0. Then S + and S .are 



Then Equations (8.20) and (8.21) give the matrices for S x and 5 V : 


S x 


( 0 h/2\ 

\h/2 0 ) 



-ih/2\ 
0 ) 


It is convenient to factor out h/2 from S X9 S v , and S c , and to write all of these 
matrices in terms of the Pauli spin matrices , «r v , a v , and a z : 


where a xt a y , and o z are 




The next step is to determine the eigenvectors and eigenvalues of the spin oper¬ 
ators. Note that we have already done this for S z \ the eigenvectors and eigenvalues 
for S z are 


( 1 

\ h 

It > = (o 

L eigenvalue = + — 





172 


Chapter 8 Spin Angular Momentum 


0 \ . , h 

j I , eigenvalue = —- 


Now consider the spin operator in the a direction S x , and assume an eigenvector 
( '!(/ j with eigenvalue c. The eigenvalue equation is 


h_ /0 1 

2 V I 0 


This leads to the determinant equation: 


)-'(£) 


" c h/2 — o 

\ft/2 ~c 

which gives c 2 — (ft/2) 2 = 0. so that c = ±ft/2. Thus, the eigenvalues of S x arc 
identical to the eigenvalues of S-. However, this is exactly what we would expect. 
There is no preferred direction for the particle, and the coordinate axes can always 
be rotated so that the .v-axis points in the z direction, so any quantities measured 
along the r,-axis should have the same set of possible values if they are measured 
along the x-axis (or any other axis, for that matter). 

The eigenvector of S x for c = +ft /2 is given by 

*(o = 

2 V 1 0 )\fc) 2\ijsz) 

which yields the two equations 

(ft/2 = (ft/2 )yjf\ 

(ft/2)ip-\ — (ft/2 )\j/ 2 

As expected, these two equations are not independent of each other: both are 
satisfied as long as i/q = fa. Thus, the eigenvector, which we will designate | —»■), 
can be any multiple of ^ | j. However, the eigenvector must be normalized. Writing 
| —») = cf | j, the normalization requirement is 


so that c — 1 /v2. Hence, the normalized eigenvector of S x with spin in the +x 
direction is 






8.5 The Stern-Gerlach Experiment 


173 


A similar calculation for the eigenvector corresponding to spin in the —x direction 
(eigenvalue = —/i/2) yields 



(8.23) 


For S v the eigenvectors are (Exercise 8.4): 


(8.24) 

(8.25) 


Note that the set of eigenvectors of S z , namely, | f ) ~ (o) ant * 14- ) = (‘i )- 

form an orthonormal basis set, so any other spin state can be represented as a 
sum of | f ) and | | ). From Equations (8.22) and (8.23), | -+) and | <—) can be 
expressed in terms of | f ) and | | )■ 

|->) = -^(l t > + u » 

I «-) = -id t )-U )) 

s/2 

Thus, a particle with a spin in the +x or —x direction is a linear combination of 
states with spins in the +z and —z directions. This is one of the counterintuitive 
predictions of quantum mechanics with no classical analog. As we will see in the 
next section, it leads to some rather strange consequences, 

8.5 BTHE STERN-GERLACH EXPERIMENT 

As noted in Section 8,2, the Stern-Gerlach experiment provided some of the first 
evidence for the existence of spin angular momentum. In this section we will 
use an idealized Stern-Gerlach experiment to examine some of the more bizarre 
consequences of quantum mechanics. 

Imagine a Stern-Gerlach apparatus with an inhomogeneous magnetic field ori¬ 
ented in the z direction, so that the apparatus separates particles based on the z 
component of their magnetic moment, A beam of individual electrons is passed 
through this apparatus, and the beam splits into two parts with m s — +1/2 and 
m s = — 1 /2, respectively (Figure 8.4). This allows us to separate the electrons into 
states of pure | f ) and j l ). 

Now suppose that the beam containing the electrons in the | f ) state is run 
through a second Stern-Gerlach apparatus with the field now aligned in the x 
direction (Figure 8.5). 


1 /) = 

A('). 

(+v direction) 


yft \ l ) 


\ /) = 

±( n 

| , (—v direction ) 


V2 \-i J 






174 


Chapter 8 Spin Angular Momentum 



FIGURE 8.4 A Stern-Gerlach apparatus with an inhomogeneous magnetic field in the c 
direction will separate a beam of electrons into two states with m, = + l/2and w, — —1/2. 



FIGURE 8.5 A beam of electrons in the j j ) state is run through a Stern-Gerlach 
apparatus with an inhomogeneous magnetic field in the x direction. 


This beam now splits into two separate beams having spins of +1 /2 and — ! , 2 
in the x direction. This result can be explained from a quantum mechanical point 
of view. Since 5, and S z do not commute, the electrons cannot simultaneously be 
in a state of definite S- and definite .S', . The first Stern-Gerlach apparatus forces 
the electrons into an eigenstate of S z , namely, | f ). However, this state is a 
linear combination of the eigenstates of .S',. Therefore, the second Stern-Gerlach 
apparatus “sees” a mixture of | -*) and | «-) and separates the beam into these 
two states. 

Given a set of electrons in the state | f }, it is possible to calculate the probability 
that a second measurement, such as the one just described, will yield a spin in the 
-hr or —x direction. In Dirac notation, if a particle is in a particular state |t/). 
and we make a measurement to determine whether or not it is in the state |</>), the 
probability that it will be found to be in the state \4>) is 


p = \(<pm 2 




8.5 The Stern-Gerlach Experiment 


175 



FIGURE 8.6 A beam of electrons in the | f ) state is run through a Stern-Gerlach 
apparatus with an inhomogeneous magnetic field in the x direction and then another Stern- 
Gerlach apparatus with an inhomogeneous magnetic field in the z direction. 


Example 8.3. Calculating Spin State Probabilities. 

A particle is initially in the | f ) state, and the spin is measured in the x direction. 
What is the probability that the spin is found to be in the +x direction? 

The particle is initially in the state j f We wish to know, when the 

spin is measured in the x direction, whether the particle will be found to be in the 
state | —►} = -j= ^ j y This probability is 


l(- I t >P 


V2 


(1 1 ) 


1 

2 


1 


Now here is where quantum mechanics gives a truly strange result. Suppose 
we set up a triple Stern-Gerlach apparatus (Figure 8.6). The first apparatus picks 
out the electrons in the | t ) state, the second apparatus splits this beam into the 
| -»} and | <—) states, and then we run the | —►) electrons back through a third 
Stern-Gerlach apparatus that is identical to the first one: it separates the electrons 
on the basis of the z component of their spins. In a classical system, since we 
selected out only electrons with spins in the +z direction, the final Stern-Gerlach 
apparatus would produce only a single beam of electrons with spins in the +z 
direction. However, this is not what happens at all; instead, the beam splits again 
into a beam with spin in the +z direction and a beam with spin in the —z direction! 



176 


Chapter 8 Spin Angular Momentum 


To see what is happening, note that the first Stern-Gerlach apparatus picks 
out electrons in the state | t )• The second Stern-Gerlach apparatus separates this 
beam into the states | —*■) and | <—). and we keep only the | -*■) electrons. But | —») 
is a mixture of | t ) and I 1 ) states, namely, | —») = (l/\/2)(| t ) + I 4 })■ So 
in making this measurement, we have added back in the | | ) component, which 
shows up in the third apparatus. The very act of measuring the spin changes the 
state of the particle. This is one of the characteristics of quantum mechanics not 
present in classical physics: in an experiment of this kind, there is no way to avoid 
changing the state of the system through the act of measurement. This idea is 
examined in more detail in the discussion of measurement theory in Section 8.8. 


8.6 ■ SPIN PRECESSION 

In this section we will see an example of how spin can be incorporated into the 
Schrodinger equation. Imagine that we have a classical magnetic dipole p in a 
magnetic field B (Figure 8.7). The magnetic field exerts a torque /ixBon the 
dipole, which will cause it to line up parallel with the field. But now suppose that 
in addition, the dipole has angular momentum (Figure 8.8). A rotating body to 
which a torque is applied will precess in a direction perpendicular to the angular 
momentum vector. 

Of course, there is no way to know if these classical analogies will carry over into 
the quantum realm until we solve the Schrodinger equation. Consider an electron 
with magnetic moment p at rest in an external magnetic field B. Since we are 
interested in how the particle spin evolves in time, we will use the time-dependent 
Schrodinger equation. 


H\x//) = ih~\xj/) (8.26) 



FIGURE 8.7 A classical magnetic dipole p in a magnetic field B experiences a torque 
/txB. which tends to align the dipole with the field. 



FIGURE 8.8 A classical magnetic dipole with angular momentum will precess in a mag¬ 
netic field. 



8.6 Spin Precession 


177 


where \*j/) is the Dirac wave function for the particle. This leads to two obvious 
questions: What is the form for //, and what is the form for |Since the electron 
is at rest, the kinetic energy part of H will be zero, and H will be given purely by 
the potential energy which is 


V = 

Since we are interested in the evolution of the orientation of the spin of the electron, 
V must be expressed in terms of the spin rather than the magnetic moment, using 
the relation 


where we have taken g s = 2 for the electron. Then the potential is 

ti 

We choose a coordinate system so that B is pointing in the c direction, so B * S = 
BS Z , and we express S : as 



so that the potential becomes 


V = n h B cj : 

Then the Schrodinger equation (Equation 8.26) becomes 



(8.27) 


This equation indicates the form for \i/): since a : is a 2 x 2 Pauli spin matrix, \\fr) 
is just the spin state of the electron written as a two-component column vector 

where i// + and iff-. will be functions of time. Then Equation (8.27) takes the form 


(J -,)(£)•<(£) 


B-bB | ; ", ) | ) = ih- d/ ^ ^ 

Carrying out the matrix multiplication yields two ordinary differential equations: 

dx//+ 

H i) B \l/ + = in 


= ih 


dt 

dxj/ 

~dT 




178 


Chapter 8 Spin Angular Momentum 


The general solutions of these two equations give the time evolution of \fr + and \f/- 

i/ + = A + e _i( ^ sS/A) ' 
xk_ = A_e /<A ' sS/, * ) ' 


where A + and A- are constants to be determined. In matrix form, the solution is 
then 


W) = 


/ A +e -HMBB/h)t \ 

\ A J 


(8.28) 


The constants A+ and A- are determined from the initial conditions. In particular, 
if we take t = 0 to be the initial time, then 




For example, suppose that the electron has spin up (i.e., in the +z direction) at 
t = 0. This corresponds to the state 


|^(r = 0)) = (o) 


so that A + = 1 and A_ = 0. Then Equation (8.28) gives the wave function at any 
later time t: 

'«“( o ) 

Although this wave function is a function of time, it represents a state of constant 
spin. To see this, note that P = |(t |i/r(r))| 2 gives the probability of finding the 
particle in the spin up state. This probability is 



0) 


e -(iH B B/h)t 

0 



Thus, the electron starts out in the | | ) state, and it stays there forever. This 
is consistent with the classical analog; a classical dipole pointing parallel to a 
magnetic field experiences no torque and does not rotate. 

Now consider the more interesting case of an electron with spin initially in the 
+x direction. In this case the initial spin state (correctly normalized) is 

'«'=°» = 7i(!) 

Then A + = l/\/2and A_ = l/\/2, so the full time-dependent wave function from 




8.6 Spin Precession 


179 


Equation (8.28) is 


I if) 


{ _ Bjh)t ^ 

sf2 


1 


Vv^ 


HHgH/h)! 


/ 


To simplify the equation, define a new quantity w given by 

a) = IjigB/h 

where co has units of 1 /time or, equivalently, frequency. Then the wave function is 



1 gi (a»/2)/ 


To understand the physical meaning of this wave function, we can evaluate it at a 
variety of times. In particular, we have 

35(0 

which is just our initial condition: the spin is in the +x direction at t — 0. Further, 

This is the original wave function multiplied by a phase factor of —1. Thus, at 
t = 2 n/co, the spin is once again pointing in the +x direction. This suggests 
that we investigate intermediate times. At the halfway point between / = 0 and 
t = 2 7i/co, i.e., at t — (\/2)2 tt/ co, we get 

\f{t = (1/2)2 n/co)) = ~ ( e e J /2 ) = - ^= ) 

which is the wave function for spin in the — x direction. Taking half of this interval 
again, we get, at t = (1/4)(2tt/(U), 


\f(t = (l/4)2n/co)} = 






which is the spin eigenstate in the +y direction. Similarly, taking t = (3/4)(27r/a>) 
gives the spin eigenstate in the —y direction. 

Putting all of this information together, we see that the electron is precessing, 
with the direction of its spin vector rotating in the counterclockwise direction 
(Figure 8.9). The period of precession is 2n/co, so the angular frequency is co = 
2jUfij9//j. This phenomenon is the basis of magnetic resonance imaging, which 
will be examined in more detail in Chapter 14. 



180 


Chapter 8 Spin Angular Momentum 


y 



8.7 ■ SPIN SYSTEMS WITH TWO PARTICLES 

In this section we consider systems of two particles with spin. We first examine the 
case in which there is no spin-dependent interaction between the two particles. We 
then consider what happens if there is such an interaction, so that the Hamiltonian 
depends on the spins. 

Noninteracting Spins 

Consider the general system of two particles, each with spin 1 /2, in which there is 
no spin-dependent interaction between the particles. The spin operators for particle 
1 and particle 2 can be written as Si and S 2 , respectively. As in the case for a single 
particle, we will consider states which are eigenfunctions of Sp S%, S\ z , and Sz z - 
We have been using the notation | f ) and | j ) to refer to eigenstates of the single 
particle spin operator S z . For the two-particle system, we write the eigenstate as 
|m s i m si)■ So, for example, | f I) represents the state with spin up for particle 1 
and spin down for particle 2 (i.e., m s \ = +1/2 and m s2 = —1/2). Thus, 

Suit t> = +^lt t) 

S 2z | t t) = “l t l) 

The eigenstates of S\ and S\ do not need to be specified in the wave function 

because these are fixed by the fact that the particles have spin 1 /2. Thus, 

s }it i) 

sfit ^ >=A 2 u)Q +i ) ,t u 



8.7 Spin Systems with Two Particles 181 

So far, this is all straightforward and rather uninteresting; all we have done is to 
write down a wave function which combines the spin information for two particles 
together. 

Now, however, consider the total spin of the system. We can write a total spin 
operator 


S = Sj + S 2 


so 


S z — S\ z + S 2z 

and 

5 2 = S 2 + S 2 2 4- 2S\ * S 2 (8.29) 

From our previous discussion about the addition of angular momentum, we know 
that the quantum number for the z component of the total spin m s is 

m s = ni\ s + m 2s 

while the quantum number for the total spin s can have the values 5 = 0 or s = 1. 
Thus, the two-particle state can also be expressed in terms of s and m s instead of 
m\ s and m 2s . The wave function corresponding to a state of definite s and m s can be 
written as \s m s ). For example, for s = 0, the only possible value of m s is m s = 0, 
and the wave function is |() 0), the singlet state. If s = 1, then m s = —1,0, or t 
with corresponding wave functions 11 — 1), 11 0), and 11 1), the triplet state. 

This leads to an obvious question: can we simultaneously measure 5 , m 5 , ni\ s , 
and m 2 . y ? This would be possible only if all of the operators S 2 , S z , S i : , and S 2z 
commuted with each other. Unfortunately, this is not the case. The expression for 
S 2 (Equation 8.29) contains the term 2S\ * S 2 , which expands out as 2(S\ x S2x + 
S\ y S 2 y + S{ z S 2 z), an d, for example, S\ x and S j v do not commute with S i : , and S 2x 
and S 2y do not commute with S 2z . More explicitly, 

[5 2 , S\ z ] = lS 2 + S 2 + 2(S lv 5 2 , + S ]y S 2y + SiM, S\ z \ 

= 2[S lx ,Si z ]S 2x +2[S ly ,Si z ]S 2 y 
= — 2i h S j _ v S 2x +2ihS ljr S 2y 

which is nonzero. A similar argument indicates that S 2 does not commute with 
5 2: . Therefore, it is not possible to measure s and m i. v , m 2s simultaneously. The 
particles can be in a state in which the z component of both spins is known exactly 
(written as | m\ s m 2v )), or they can be in a state in which s and m s are known exactly 
(written as |s m s )), but not both at the same time. 

Suppose that we are in a state of definite s and m s . Although we cannot uniquely 
specify the values of m\ s and m 2s » we can write the state |s m s ) as a linear com¬ 
bination of the four possible \m\ s m 2s ) states, since the latter form a basis set. In 





182 


Chapter 8 Spin Angular Momentum 


other words, we can write 


1$ m s) — c \I t t) + c 2lt I) + CjI 4- t)+ c 4l4 I) (8.30) 

and we can calculate the constants c*, c 2 , c$, and C 4 . One reason for doing this is that 
if the particles are in a state of definite s. m s . there is nothing to prevent us from 
subsequently measuring the individual spin states. Although we cannot predict 
the result in advance (since we are not in an eigenstate of and .SV), Equation 
(8.30) can be used to find the probabilities of measuring a particular value of ni\ s 
and mi s . 

Consider first the state |1 1). It must always be true that m is + m 2 ,, = m s , and 
the only way to have m\ s + mi, = 1 is for both m\ s and m 2s to be +1/2, i.e., for 
both particles to be in the spin up state. Thus, 


|1 l) = l t t) 


(8.31) 


A similar argument gives 


|l -I) = U 4) 


(8.32) 


On the other hand, both 11 0) and |0 0) have m s = 0, so they are both linear 

combinations of | f j) and [ j. t) - To find the desired linear combinations, recall 
that the lowering operator S_ acts on 11 1) to give a multiple of 11 0). Since S_ 
is just S x — i S y , it is correct to write 

S_ = S,_ + S 2 - 

We can therefore begin with Equation (8.31), apply S_ to the left-hand side, and 
Sj_ + S 2 - to the right-hand side (using Equation 8.19), and derive an expression 
for |1 0) as a function of | | and | | |). This procedure gives 

5-11 1) = 5,_| t t) + S 2 -I t t) 

hy/i{\ + 1) - 1(1 - 1)|1 0) = hy/(\/2)(\]2 +1) — (1 /2>< 1/2 — l)i l t) 

+ A?V(l/2)(l/2+ l)-(l/2)(l/2- 1)1 t D 


which simplifies to 


(8.33) 


This method cannot be applied to the state [0 0), but now note that all of our spin 
states should be orthonormal. The normalized linear combination of j f i) and 






8.7 Spin Systems with Two Particles 


183 


t) which is orthogonal to -J=| | f) + -4=1 | 4) is 



(8.34) 


(Of course, there is always the freedom to multiply the right-hand side of Equation 

( 8 . 34 ) by a factor with unit absolute value such as — 1 .) 

We have now expressed all four |s m s ) states as linear combinations of the 
j tn 1 c m 2.v ) states [Equations (8.31), (8.32), (8.33), and (8.34)]. Although we have 
examined the specific case of two particles with spin 1 / 2 , this result can be gen¬ 
eralized to particles with other spins. The constants which appear in Equations 
(8.31), (8.32), (8.33), and (8.34) (and in those more general expressions) are called 
Clebsch-Gordon coefficients. 


Example 8.4. Probabilities for a Two-Particle System. 

A system of 2 particles, each with spin 1/2, is in the singlet state. A measurement 
is made of the - component of the spin of the first particle. What is the probability 
that m 1 v = + 1 / 2 ? 

In the singlet state, the wave function is 

10 0) = 4=U t) -4=1 t I) (8.35) 

V2 v 2 

The only state with m 1 , = +1/2 appearing on the right-hand side of Equation 

( 8 . 35 ) is | f 4 ,), so the probability that particle 1 is in the spin up state is 

P = l(t I |0 0)| 2 

= Kt l I (4=1 i t)--4l t 4>)l 2 

\v2 V 2 / 

= 1/2 

Thus, there is a 50% chance that the first particle is found to be in the spin up state 
and a 50% chance it is found to be in the spin down state. 


Interacting Spins 

Now consider two spin-1 /2 particles with a spin-dependent interaction. One of the 
simplest possible spin interactions is given by the Hamiltonian 


H = X S, • S 2 


(8.36) 


where X is a real constant. Here it is assumed that the particles are fixed in space, 
and there is no other interaction between them, so that Equation (8.36) gives the 
entire interaction. Since this Hamiltonian depends on all three components of Si 





184 


Chapter 8 Spin Angular Momentum 


and S 2 , it will not commute with either Si - or with S 2z . Hence, states of definite 
ni\ s and m 2s , such as | f f), | f I), etc., cannot be eigenfunctions of H. 

On the other hand, the Hamiltonian given in Equation (8.36) does commute 
with S 2 . This is because Equation (8.29) can be rewritten as 

Si -S 2 = ^(S 2 -S?-S 2 2 ) 

which allows the Hamiltonian in Equation (8.36) to be written as 



When written in this form, it is clear that H commutes with S 2 and S z . Thus, states 
of definite s and m s are also eigenstates of H , and we can compute their energies. 
We obtain 

H\s m s ) = ^h 2 s(s + 1) - j Q + ^ Q + 1^ |j m x ) 

Thus, the energy £ is a function entirely of .v and is independent of m s \ all three 
triplet states are degenerate. Inserting the actual values for s gives the energy levels 
of the system: 

1 .7 

s = 1 (triplet state) E = -kh~ 

4 

3 o 

5 = 0 (singlet state) E — —-Xh~ 

4 

The energy levels of more complex spin-dependent Hamiltonians can be calculated 
using similar methods. 


Example 8.5. The Magnetic Dipole-Dipole Interaction Between Two 
Particles. 

The magnetic dipole-dipole interaction between two particles with magnetic mo¬ 
ments n , and fi 2 fixed in space at a separation r is given by the Hamiltonian 





8.7 Spin Systems with Two Particles 


185 


The choice of the coordinate system is arbitrary, so we choose the vector r sep¬ 
arating the two neutrons to lie along the c-axis. Then r — az and r = a, so the 
Hamiltonian becomes 


H = ( ~ (S, • S 2 - 3S|-S 2 -) 

\2 m„J a- 

As above, S] • S 2 can be written as Si • S 2 = |(S 2 — Sf — Si). It is also true that 
S 2 = (5,. + S lz ) 2 = Si + Si + 2S lz S 2z , so " 


s iz s 2: = -(S: 


sl 


si) 


and the Hamiltonian becomes 


H = 


g,,e 


cm,, 


1 (\ 


(S- 


Sj 


Si) 


•(5; - Si - S;.) 


(8.37) 


If the neutrons are in a state |,y m s ), this state is an eigenfunction of H, The only 
possible confusion arises from the operators Sj\ and S?. in Equation (8.37), since 
the state \s m s ) is not a state of definite m ]s and w?. s . However, note that for a spin- 
1/2 particle, S :j f ) = (h/l) 2 \ f ) and S:\ i ) = (h/l) 2 \ | ), so S 2 applied to 
any spin state gives an eigenvalue of h 2 j 4. 

Then applying H to the state \s m s ) yields 


, v >. , 

ms = i (C 2 - ~ s i> ~ ~ s i ~ 4>) I s '»-) 


-40 (v 

4 mla* ) \ 2 


S(.V + 1) 


2* 

o 


(m; 




|,v m s ) 


2 2 h2 

- j ‘- f |.v(.v + I) - 3m;]|.v m s ) 

mw 


so the energy levels are 


E 


1 1 . "7 

o-p-'h" 

" -l.v(.s + l) 


8 mla 


3«a 


n 


The dipole-dipole interaction splits all of the s, tn s states into distinct energy levels 
with the exception of the states m s = ± 1, which remain degenerate. These energy 




186 


Chapter 8 Spin Angular Momentum 


levels are 


5 = 1 , m s = ± 1 , 

s = 0, m s — 0, 

5 = 1, m s — 0, 


g 2 e 2 /? 2 

£ = +^r-r(- 1) 


E = 
E = 


8m 2 n a 3 

0 

8m 2 a 3 


( 2 ) 


The dipole-dipole interaction between the proton and electron is the basis for 
hyperfine splitting in hydrogen, which is discussed in more detail in Chapter 9. 


8.8 ■ MEASUREMENT THEORY 

Consider a particle in a one-dimensional infinite square well of width a centered 
at the origin. If the particle is in the ground state, then the time-dependent wave 
function can be written as 





Rewriting the spatial part of the wave function as a sum of complex exponentials, 
the full wave function is 


^(x, t) 



i{nxfa—Etfh) 


+ 



(nx/a+Et/h) 


(8.38) 


This wave function looks like the superposition of two waves moving in opposite 
directions. However, the particle, when observed, can obviously only be moving in 
a single direction. Suppose that a measurement is made of the direction of motion 
of the particle, and assume, for instance, that the particle is found to be moving to 
the right. Then the wave function becomes 


'U(x,t) = e ' (r[xla - Ellh) 


(8.39) 


Thus, the act of making a measurement alters the wave function, collapsing it from 
the form given by Equation (8.38) to the form given by Equation (8.39); the wave 
function changes from a superposition of states into a single state. 

The idea that the particle is in an indeterminate superposition of states, and 
is only forced into a single definite state by the act of measurement is called the 
“Copenhagen interpretation,” and it is the most widely-held view of the nature 
of measurement in quantum mechanics. However, this interpretation has some 
disturbing consquences. Consider, for example, a set of two particles, each with 
spin 1/2, that is in the singlet state (e.g., as in Example 8.4). If the z component 
of the spin of the first particle is measured and found to be +1/2, then the second 



8.8 Measurement Theory 


187 


particle must have z component of spin equal to —1/2, since tn\ s + m 2 s = 
m s = 0. Thus, the final wave function in this case is j f 4). The act of mea¬ 
suring the individual spins of the particles alters the wave function from 
711 ^ ^ — 711 ^ 'l') to I t 4)- This is called the “collapse of the wave func¬ 
tion.” Before the measurement is made, the particles have the potential to be in 
either the | f f) state or the | | f) state, but they are not actually in either state. 
The act of measuring the spin of one particle collapses the wave function into one 
of these two states. 

Now consider the following thought experiment. Two electrons are prepared in 
a singlet state. Without either spin being measured, one electron is left on earth 
while the second is transported to Alpha Centauri. A scientist on earth measures 
the spin of the first electron and finds it to be in the spin up state. Immediately 
following this measurement, a scientist orbiting Alpha Centauri measures the spin 
of the second electron and, of course, finds it to be in the spin down state. 

This may not seem particularly remarkable. For example, if we began with a 
white marble and a black marble, leaving one on earth and transporting the second 
one to a nearby star, the discovery that the marble left on earth was white would 
immediately indicate that the other marble must be black. The quantum mechanical 
situation is more subtle, however. Strictly speaking, the two electron spins remain 
in a linear superposition of the | f f) and | ! f) states until one of the spins 
is measured. The scientist on earth collapses the wave function by measuring the 
spin of the first electron, and somehow the second electron, 4 light-years away, 
immediately “knows” to assume the opposite spin state! This rather strange result 
is a version of the Einstein-Rosen-Podolsky paradox. 

As an even more extreme case, consider the example of “Schrodinger’s cat.” 
A cat is inside a closed box (Figure 8.10). A radioactive substance is decaying; if 
a decay occurs in a fixed time interval, poison gas will be released into the box 
killing the cat. On the other hand, if no decay occurs in the given time interval, the 
cat will remain alive. What does the Copenhagen interpretation say about the state 
of the cat at any given time? Before we open up the box to check on the cat, it can 
be treated as the superposition of two states, \^r a iive) and 1^^), and we can write 


I ^cat) 


V2 


^Palive 


1 

H 7r I ^Pdead) 


Now we open up the box, and our act of observing collapses the wave function. If 
the cat is dead, for example, we find 


! 'Peat) — I dead) 


The state of the cat prior to our observation, however, seems absurd. How can the 
cat be in a linear superposition of “alive” and “dead?” This seems like a very un¬ 
satisfying state of affairs. In fact, the entire Copenhagen interpretation represents a 
rather substantial shift in the way physicists had viewed the nature of measurement, 
so other interpretations have been put forward. 



188 


Chapter 8 Spin Angular Momentum 



FIGURE 8.10 The unfortunate Schrodinger’s cat. 


Hidden Variables 

An alternative way of framing quantum mechanics is to assume that it only seems 
like a probabilistic theory because it represents an approximation to a deeper, 
deterministic theory. This point of view is perhaps the most conservative, since 
it assumes that the more bizarre aspects of quantum mechanics are due only to 
our ignorance of the true underlying theory. There are certainly precedents for this 
idea in other areas of physics. Suppose, for instance, that several measurements 
are made of the speed v of molecules in a sample of gas. This distribution ot 
speeds would appear to be random, and a large enough set of measurements would 








8.8 Measurement Theory 


189 


uncover a probability distribution P(v) for the molecular speeds given by 



called the Maxwell distribution . One might be tempted to conclude that the motion 
of the gas molecules is completely random, and the only thing measurable is the 
probability distribution. 

In fact, of course, this is not the case. The molecules follow trajectories that are 
completely determined by classical mechanics. The motion only seems random 
because of the enormous number of molecules involved. This is the fundamental 
idea of statistical mechanics: there exist deterministic systems which are so large 
that an exact calculation of all of the trajectories is impractical: the best that can 
be done is to describe the system in terms of probabilities. 

In the same way, we can postulate that quantum mechanics is actually determin¬ 
istic. and only our ignorance makes it appear random. This is the hidden variables 
formulation of quantum mechanics. Unfortunately for those dissatisfied with the 
Copenhagen interpretation, hidden variables models face serious difficulties. In 
particular, J.S. Bell showed in 1964 that hidden variables models can be tested 
experimentally. By considering experiments of the sort discussed in the previous 
section, in which particles with a known total angular momentum are allowed to 
fly apart and the individual spins are then measured. Bell showed that the Copen¬ 
hagen interpretation and the hidden variables interpretation give different results. 
Experiments based on Bell’s idea have subsequently shown that it is the hidden 
variables model, not the Copenhagen interpretation, which is incompatible with 
the observed behavior of such particles. 

There is one loophole here: Bell’s theoretical work, and the subsequent ex¬ 
perimental investigations, apply only to so-called local hidden variables models. 
These are models in which information propagates at a finite speed, e.g., a speed 
at or below the speed of light. If information can propagate instantaneously from 
one point to another, then hidden variables theories can still be made compati¬ 
ble with experiment. However, this possibility seems even more outlandish than 
the bizarreness of the Copenhagen interpretation, so it is normally not considered 
seriously. 

The Many Worlds Interpretation of Quantum Mechanics 

In 1957 Hugh Everett proposed another way to avoid the problem of the collapse 
of the wave function: suppose that it never really collapses! Consider, for example, 
a particle in the one-dimensional potential well discussed in the previous section. 
Suppose that when you measure the direction of the particle, the wave function 
doesn’t collapse; instead, the act of measurement splits the universe into two sepa¬ 
rate worlds: in one world the particle is observed to be moving to the left, and in the 
other world it is moving to the right. This is called the many worlds interpretation 
of quantum mechanics. In this interpretation, the wave function never collapses. 
Instead, the universe is constantly branching into multiple worlds; any possible 
outcome of a quantum measurement happens in one of the resulting worlds. So, 



190 


Chapter 8 Spin Angular Momentum 



FIGURE 8.11 In the many-worlds interpretation, every measurement causes the universe 
to branch into multiple worlds; each possible outcome of the measurement occurs in one 
of the worlds. 


for example, Schrodingers cat is alive in one world and dead in another (Figure 
8.11). As the quantum mechanics instructor of this author once said, “Strangely 
enough, there are grown men who believe this;' (PJ.E. Peebles, Princeton Univer¬ 
sity, 1979). 


EXERCISES 

8.1 (a) A particle with spin 1 has orbital angular momentum l = 0. What are the possible 

values for the total angular momentum quantum number y ? 

(h) The same particle has / = 3. What are the possible values for y? 

8.2 (a) A particle has spin 3/2 and orbital angular momentum/ = 1. What are the possible 

values for the total angular momentum quantum number y? 

(b) For each value of j in part (a), determine the possible values of m r 

8.3 Determine (using the matrix representation) which of the following operators are 

Hermitian and which are not: S t , S v , S z , S. f S„. 



Exercises 


191 


8.4 Derive the eigenvalues and the corresponding normalized eigenvectors of S, given 
in Equations (8.24) and (8.25). 

8.5 A particle has spin 1, so that m s ~ — 1,0, or 1. Derive the matrices which correspond 
to S x , 5 V . and S : . 

8.6 (a) A particle has s = 3/2. The operator 52. is defined to he the square of the raising 

operator: .52 + ~ (5. f r> where 5. r is the usual raising operator: 

S+\s m s ) — t)s/s{s + 1) m,{m s -f 1 )|.v /n v 4~ I) 

Derive the matrix corresponding to the operator 52~. 

(b) What is the matrix corresponding to the adjoint operator (5 44 .) T ? 

8.7 Let the operator Q be given by Q — 5, 5 , where 52 and 5 are the usual raising and 
lowering operators: 

5 . (x m s ) = /i v /7(.v -ft)- //i A (w s - I )[$ - 1) 

52 |x Wj) = /) v/sTFTTT 1 4- lTi..v w A -f 1) 

Derive the matrix corresponding to the operator Q for a spin 1 particle. Determine 
whether or not Q is Hermitian. 

8.8 Using the matrix representation of the spin operators for s — 1/2, verify the results 
for [5 V . S y 1, [5\. 5 : |. and [5 : . S x ] given in Equations (8.1 )~~(8.3). 

8.9 A large number of spin-1 /2 particles are run through a Stern -Gerlaeh machine. When 

they emerge, all of the particles have the same spin wave function ( ) {in the usual 

basis in which | ) represents spin in the +z direction, and ^ j represents spin in the 

—z direction). The spin of these particles is measured in the direction. On average, 

9/25 of the particles have spin in the Tc direction, and 16/25 have spin in the --c 
direction. 

(a) Determine a possible normalized spin wave function ( -y j. 

(b) Is there a single unique solution to part (a), a finite number of different solutions, 
or an infinite number of different solutions? (Multiplying the entire wave vector 
by a constant does not count as a different solution.) 

8.10 A Stern -Gerlaeh experiment is set up with the axis of the inhomogeneous magnetic 
field in the v-y plane, at an angle 0 relative to the .v-axis. Call this direction r: 


y 



Sr = {cos{9)5, 4 - (sind)5 v 




192 


Chapter 8 Spin Angular Momentum 


(a) For a spin-1/2 particle, calculate the matrix corresponding to S r . Calculate the 
eigenvalues and corresponding eigenvectors. Normalize the eigenvectors and 
express them in the form a\ f) + b\ |), where a and b are constants. 

(b) Suppose a measurement of the spin of the particle in the r direction is made and it 
is determined that the spin is in the positive r direction, i.e., S r \if/} = (+$/2)|^}, 
Now a second measurement is made to determine m sx (the component of the 
spin in the x direction). What is the probability that m sx = -1 /2? Suppose that 
instead of measuring m sx , the z component of the spin m s is measured. What is 
the probability that m s = +1/2? 

(c) Suppose that the particle has spin in the positive r direction as in part (b). The z 
component of the spin is measured and it is discovered that m s = +1 /2. Now' a 
third measurement is made to determine m sx . What is the probability that m sx = 
- 1 / 2 ? 

8-11 A spin-1/2 particle is in the state \ f) = </T/lS\ f) + i vT/3 j j). 

(a) Verify that the wave function is correctly normalized. 

(b) A measurement is made of the x component of the spin. What is the probability 
that the spin will be in the —x direction? 

(c) Suppose a measurement is made of the spin in the z direction and it is discovered 
that the particle has m s = — 1 /2. Now a second measurement is made to determine 
the spin in the x direction. W T hat is the probability that the spin will be in the +jt 
direction? 

8.12 An electron is precessing in a magnetic field. The wave function for the electron is 

1 / cos cot + sin cot \ 

^/i \ cos cot — sin cot ) 


(a) Describe the plane of rotation of the spin of this particle. 

(b) In what direction is it rotating in this plane? 

8.13 A magnetic field pointing in the —z direction produces a Hamiltonian H = — toS -, 
where co is a constant with units of frequency. A spin-1/2 particle is placed in this 
magnetic field. At / = 0, the particle is pointing in the +y direction. 

(a) Derive an expression for the spin vector ^ ^ ^ as a function of time. 

(b) At t = 7 i jco, a measurement is made of the spin in the x direction. What is the 
probability that the spin is in the +jc direction? 

(c) Suppose that at t = tt/co, a measurement is made of the spin in the x direction, and 
it is found that the spin is.in the ~hx direction. Then at the time t = In/co, another 

measurement is made of the spin in the x direction. What is the probability that 
the spin is in the +jc direction? 

8.14 An electron is precessing in a magnetic field aligned along the +z~axis. At / = 0, the 
spin of the electron is in the positive x direction. The wave function is 




For t > 0, calculate the probability of finding the electron in the state 

(a) tn s = +h /2 

(b) m sx = +/?/2, where m sx is the component of spin in the x direction. 



Exercises 


193 


8.15 A spin-1/2 particle is placed in a magnetic field pointing in the -hv direction which 

produces a Hamiltonian H — coS x , where w is a constant with units of frequency. At 
t = 0, the particle is pointing in the direction. Derive an expression for the spin 
vector ( ) as a function of time. 

8.16 Amagnetic field pointing in the +z direction produces a Hamiltonian H = ojS., where 
co is a constant with units of frequency. A spin-1 particle is placed in this magnetic 
field. The matrix corresponding to S : for a spin-1 particle is 

S z = h ( 0 
\0 

At t — 0, the particle is pointing in the -Fa* direction with normalized spin vector 

/ v>r \ 

Derive an expression for the spin vector Vo j as a f unction of time. 

V Vo / 

8.17 Consider a system of two particles: particle l has spin 1. and particle 2 has spin 
1/2. Let S be the total angular momentum operator for the two particles, where the 
eigenvalues of S : and S : are h 2 s(s + 1) and hm s , respectively. The particles are in 
the state s =3/2 and m s = 1/2. 

(a) Calculate the wave function |.v = 3/2 m s = 1/2) as a linear combination of the 
wave functions | m\ s m 2 . v ), where m is the c component of the spin of particle 
L and m 2s is the z component of the spin of particle 2. 

(b) Find the probabilities that the z component of the spin of particle 1 is 

L m Xs = +1 

ii. m\ s = 0 

iii, m = — 1 

8.18 Suppose that particle 1 (with spin 1) and particle 2 (with spin 1/2) interact via the 
Hamiltonian operator 


0 0 \ 

0 0 J 

0 - 1 / 


H = AS t * S 2 

where k is a constant. Calculate the energy of the state |.v tn s ). 

8,19 Two spin-1/2 particles are fixed in space with the Hamiltonian 

H = aS : 4 - bS 1 

where a and h are positive constants, and as usual, S 2 is the total spin operator squared 
and S : is the operator which gives the c component of the total spin. What are the 
energy levels of this system? 




C HA PT E R 


Time-Independent Perturbation 

Theory 



In theory, the Schrodinger equation allows us to solve any quantum mechanical 
system exactly. We simply insert the potential V and solve for the wave function \j; 
and the energy E. Unfortunately, there are very few potentials, such as the infinite 
square well or the Coulomb potential of the hydrogen atom, for which a simple 
exact solution exists. In order to make any further progress, we need to develop 
some techniques for finding approximate solutions to the Schrodinger equation. 
This chapter and Chapter 11 are devoted to a very important set of these techniques 
called perturbation theory. 

The basic idea of perturbation theory rests on a simple general argument. Sup¬ 
pose we begin with a potential for which we can solve the Schrodinger equation 
exactly, such as the infinite square well of width a: recall from Chapter 4 that the 
ground-state energy and wave function are given by \[/ = Ja sin (nx/a) and 
E — h~n 2 /2ma 2 . Now suppose we make a tiny change in V such as a small notch 
in the center of the potential (Figure 9.1). We cannot solve the Schrodinger equa¬ 
tion for this new potential, but intuition suggests that a small change in V ought 
to produce a small change in \j/ and in E. This intuition is correct. The reason is 
that the Schrodinger equation is a special kind of differential equation: it is linear, 
i.e., and its derivatives are taken only to the first power. Linear differential equa¬ 
tions like the Schrodinger equation have the property that small changes in the 
parameters produce small changes in the solution. The fact that a small change in 
V produces a small change in i// and E is the fundamental idea of perturbation 
theory. 


9.1 ■ DERIVATION OF TIME-INDEPENDENT PERTURBATION THEORY 

Now we will calculate mathematically the change in E produced by an arbitrary 
small change in the Hamiltonian H. | Although we talk generically about a change 
in the Hamiltonian, this usually amounts to a change in the potential. | Assume we 
have a Hamiltonian H for which we can solve the Schrodinger equation exactly. 
We need to consider two possibilities for a small change in H : either the change 
in H is constant in time, or it varies as a function of time. If the change in H is 
constant in time, we are dealing with time-independent perturbation theory, which 
is the subject of this chapter. If, on the other hand, the change in H varies with time, 
w ; e have time-dependent perturbation theory, which is discussed in Chapter 11. 


195 



196 


Chapter 9 Time-Independent Perturbation Theory 


i p i/P 




FIGURE 9.1 The infinite square well on the left has the ground-state wave function and 
energy i/' = y/Tfa sin(.*r v/o) and E = tvn 2 /2mu 1 . A small change in the potential, shown 
on the right, produces a small change in t j/ and E. 


We will now derive w'hat happens if we have a small, time-independent change 
in the Hamiltonian. Assume that we have a Hamiltonian //<>, for which we can find 
all of the eigenstates ji j/ n ) with energies E n : 

HoW„) = EM 

Note that this is just shorthand for an infinite set of equations: Ho\^i) — E\\\//\), 
HM — £ 21 ^ 2 }. and so on. For example, the | ifr„) could correspond to the hy¬ 
drogen wave functions, the spin eigenfunctions of an electron in a magnetic field, 
or any other set of wave functions that are exact solutions of the Schrodinger 
equation. Now add a small perturbation XH' to the Hamiltonian: 

H = H {) + XH' (9.1) 

Here X is taken to be a dimensionless small number, X <S£ 1, so that the perturbation 
XH ' is small compared to the original Hamiltonian Ho- We would like to solve the 
new Schrodinger equation: 

H\f) = E\if) (9.2) 

In this equation H is the new' (perturbed) Hamiltonian of Equation (9.1), | \j/} is 
the new wave function after we have added the perturbation to the Hamiltonian, 
and E is the new energy. Of course, we cannot solve this equation exactly (or 
we wouldn’t be bothering to develop perturbation theory), but we can use some 
mathematical techniques to see how the change in the Hamiltonian changes the 
wave functions and energies. 

The first step is to write the new energy and wave function in Equation (9.2) as 
a power series in the small number X that appears in Equation (9.1): 

E = E„ + XE l 11 + a 2 £ |2) + • • • (9.3) 

IVr) = \ir n ) + X\<j> { ) + X 2 \ct> 2 ) + ■ ■ ■ (9.4) 

In these equations, | *//„} and E n are the original eigenfunction and energy betore 
we apply the perturbation; since Equation (9.2) has an infinite number of solu¬ 
tions (corresponding to a small perturbation applied to any of the eigenfunctions 



9.1 Derivation of Time-Independent Perturbation Theory 


197 


of Ho) we have to pick a particular eigenfunction to perturb. The energies 
£ iS L £ 121 . ... and the wave functions j <p\) n |<£>) are unknown: our goal is to solve 
for them. 

Recall that we are interested in the small change in £ which results from our 
small change XH' in the Hamiltonian. The first term in Equation (9.3) gives us 
the energy before we apply the small perturbation. The rest of the terms give us 
the small change in the energy due to the small change in H. But if X is tiny, 
only the first of these terms really matters. For instance, if we take X = 10~ 6 , then 
X 2 = l(r i: , X 3 = 1()“" 18 , and so on. So the third term in Equation (9.3) is tiny 
compared to the second term, and the fourth term is tiny compared to the third 
term. That means the change in £ is essentially X£ I! *, and we can ignore all of the 
other terms in Equation (9.3). [Exception: if. fora particular perturbation, X£ {i| is 
exactly zero, we will have to go further and solve for X 2 £ f2 f] 

Substituting the expressions for £ and |y/) from Equations (9.3) and (9.4) into 
the Schrodinger equation given by Equation (9.2) gives the following rather messy 
result: 

(Ho + XH )(|i^Vi) + X|0i) + X-# 2 ) + * * *■) 

= (£„ + AE m + A 2 £ 121 + • ■ + k\<t> { ) + k 2 \(p 2 ) + • • •) 

(9.5) 

Although it looks like this is only making matters worse, we can now apply two 
ideas to simplify this equation. First, recall that if A is small, then k is much larger 
than k 1 , which is much larger then V. and so on. This means that for very small k, 
the terms in Equation (9.5) with different powers of A do not affect each other, so 
Equation (9.5) must be satisfied separately for each individual power of k. Equating 
powers of k in this equation gives 

A° /W„) = E„ W„) (9.6) 

k l kH'Wn) + kH 0 \4>i) = kE,,^) + kE l]] \i{f n ) (9.7) 

A 2 k 2 H {) \(p2)+k 2 H'\<p l ) ^k 2 ^]^,}+k 2 E [2i \if„)+k 2 E n \<h) (9.8) 

Equation (9.6) is just the original unperturbed Schrodinger equation, which is 
reassuring, but we cannot make further progress with Equations (9.7) and (9.8) 
unless we know what happens when /£> operates on \(p\) and j <p 2 ). However, we 
don’t even know what |0i) and \<j> 2 ) are! Nonetheless, we can solve this problem 
by applying a second idea. Recall that for any Hamiltonian //<,, we can find a set 
of eigenfunctions \tfr m ) which form an orthonormal basis, i.e., any wave function 
| yf/) can be written as 

W) = cil^t) + (’iMi) -1-F c m |t// m ) H- 

So even though we don’t know |0 t ) and \<p 2 ) in Equations (9.7) and (9.8). we can 
expand them out as linear combinations of eigenfunctions of Hu, and we do know 




198 


Chapter 9 Time-Independent Perturbation Theory 


what happens when we apply H 0 to these eigenfunctions. We write 

10)} = CiS'/'l) +C 2 W 2 ) + + + ••• = E c ' Ifi) (9.9) 

I 


and 


m = (9 - 10) 

i 

Then when Ho is applied to the sums in Equations (9.9) and (9.10). it simply pulls 
out the appropriate energy in front of each term: 


H 0 m = Ho = E E ‘ Ci ^ 


(9.11) 


and 


Ho\4> 2 ) =Y, E ‘ di ^ i} ( 9 - 12 ) 

i 

When Ho is applied to |0i) in Equation (9.7), it produces the sum given in Equation 
(9.11). Then we get 

XH'\f n ) + k E EiCiWi) = E c 'i^) + (9.13) 

i 1 

We would like to solve this equation to find £ 111 . We take the inner product of ( 1 j/ H | 
with both sides of the equation, recalling that (t/r„|i/r n ) = 1: 

X(t(f n \H'\\ff n } + k E EiCiWnWi) = kE n EcT^J^i) + *£ m (9.14) 

i i 

Note that \x// n ) is the original eigenfunction which satisfies the unperturbed 
Schrodinger equation, while the IV'vTs are the complete set of such eigenfunc¬ 
tions, including \\f/„) as a particular case. Hence, when we take the inner product 
of (^„| with |^,), we get zero for i ^ n and one for / = n. This selects out / = n 
from the sum in Equation (9.14) giving 

XWn\H'W n )+XE„c n =XE n c n +XE [l] 

SO 

= ty H \kH'W„) (9-15) 

where kE^ ] 1 is the dominant or lowest order (i.e., lowest power of k) change in the 
energy due to the small change kH' in the Hamiltonian. 

We can now calculate the second-order change in E, i.e., the term in Equation 

(9.3) which is proportional to k 2 . Of course, it is reasonable to ask why we would 



9.1 Derivation of Time-Independent Perturbation Theory 


199 


ever want to know this, since A£ }13 is much larger than A 2 £^ 2] . Normally, the 
second order perturbation is irrelevant except for one very important case: if A V ) ! J 
is exactly zero, then A 2 £^ gives the dominant change in the energy. To find 
A 2 £ 12! , we substitute both Equations (9.9) and (9.10) into Equation (9.8). When 
Hi) operates on the sum of eigenvectors as in Equation (9.12), we get 


Y^diEim + = k 2 E [l 'J^cA1/i) + A 2 E [2 ' 


i n ) T A~ E tJ ^ \’ i Pi / 


As before, we take the inner product with (t/r„|, and now we substitute {^ n \H'\^ n ) 
for E 1 ' 1 . Solving for A. 2 £ l2J , we get 

X 2 E {2] = X 2 Y,CiW»\H'm ~ l 2 c„(i'„\H'\xI,„) 


The right-hand side is the sum over all eigenfunctions |i//,) minus the particular 
eignfunction |i// /? ), so it can be written as 

a 2 E |2 > = \ 2 Y^Ci(Tlr„\H'\ in) (9.16) 

i ##? 


Now all we have to do is to find the coefficients c, which first appeared in Equation 
(9.9). We go back to Equation (9.13). but now instead of applying (y/„| to both 
sides, we apply an arbitrary eigenfunction where {*//„,| ^ (\j/„\. This gives 

\<//») T XE m c m — XE„c m T 0 


so that 


E„ - E m 


(9.17) 


Substituting this expression for the r,’s in Equation (9.16) gives the final expression 
for a : £ |21 : 


a 2 £ 121 


E 

i t 


{V>„[a//'1v>,)(V/ / |a£'|V/„) 
£« - E: 


rr 4 E„ — £, 

I&I 


(9.18) 


To summarize: if we start out with some Hamiltonian Hq for which we can 
solve the Schrddinger equation exactly, and we begin in an eigenstate |i p n ) with 
energy £„, then after we change the Hamiltonian by the small amount XH\ the 
dominant change in the energy, proportional to X, will be given by Equation (9.15), 
w'hile the next largest change, proportional to a 2 , will be given by Equation (9.18). 






200 


Chapter 9 Time-Independent Perturbation Theory 


While we have used A to remember which terms are larger than others, we can 
now simplify our expressions by writing the change in H as 


H = H 0 + Hi 


where 


H x = A H 

is very small compared to the unperturbed Hamiltonian Hq. Then we write the first 
and second order changes in the energy as 

E {[) = A E U] 


and 


E (2) =\ 2 E [2] 

These changes in the energy are given by 

E (X) = {f n \H x \ir n ) 


(9.19) 


and 


£ (2) = V 

p _ p. 
i*n n ' 


where the which appear in Equation (9.20) are all of the other eigenfunctions 
of Hq aside from the one being perturbed. Equations (9.19) and (9.20) are the main 
result of this section; they give the first-order and second-order perturbations to 
the energy from an arbitrary perturbation to the Hamiltonian. As usual, the inner 
products which appear in Equations (9.19) and (9.20) can represent a variety of 
different mathematical possibilities. If the wave functions and the perturbation 
are functions of position (as in Example 9.1), then these inner products will he 
integrals. If the eigenstates are spin states and the perturbation is a function of spin 
operators (as in Example 9.2), then these inner products will be matrix products. 

There is one case in which the entire argument falls apart. If the original 
eigenfunction \\p-„) is degenerate with some other eigenfunction \ij/ m ) of Hq. i.e., 
E n = E m , then the argument fails. This can be seen from the fact that both Equa¬ 
tions (9.17) and (9.20) “blow up” in this case with zero in the denominator. Note 
that Equation (9.19) might be completely well behaved in this case, and it is a very 
common mistake to use Equation (9.19) in the case of this kind of degeneracy. 
DON’T DO IT! Since the second-order term in this case is infinite, the entire per¬ 
turbation expansion becomes inapplicable for the case of degeneracy. Perturbation 






9.1 Derivation of Time-Independent Perturbation Theory 


201 


theory applied to degenerate eigenfunctions requires some further mathematical 
machinery (degenerate perturbation theory) which is beyond the scope of this text. 
Note, however, that there is one special case (which we will encounter frequently) 
in which we can use Equations (9.19) and (9.20) with degenerate wave functions: 
if (i/r„ | H\ | i/r,-) =0 whenever E n = £,, then our expressions will be well behaved. 

Although the change in the energy is usually the quantity that can be most easily 
measured directly, it is also possible to calculate the change in the wave function 
due to the perturbation. Returning to Equation (9.4), we see that the lowest-order 
change to \\fr„) is given by k\(f>]), and |0i) has already been expressed as a sum 
over the unperturbed eigenfunctions in Equation (9.9) with the c, ’s that appear in 
this equation given by Equation (9.17). Substituting these values for the c, ’s from 
Equation (9.17) into Equation (9.9), we obtain 


A|0l) = £ 

= E 


En-Ei 


m 


E n - Ei 


m 


Note that we have dropped the i — n term from the sum in Equation (9.9). We 
have the freedom to do this because this term is just proportional to the original 
unperturbed eigenfunction |\// n ). Hence, in the original expansion of the wave 
function (Equation 9.4), this term can be removed from the X\(p )) term and absorbed 
into the | r ]/„} term. Then to first order, the new wave function in the presence of 
the perturbation is 




E„ ~ Ej 


m 


Thus, the effect of the perturbation is to “mix together” all of the other eigenfunc¬ 
tions in the new perturbed wave function. 


Example 9.1. The Anharmonic Oscillator. 

In Chapter 4, we derived the solutions for the one-dimensional harmonic oscillator 
potential. 


VU) = 



The energies are 


E = (n +hao, n = 0, 1,... 




202 


Chapter 9 Time-Independent Perturbation Theory 


V(x) 



FIGURE 9.2 Solid curve: the unperturbed harmonic oscillator potential V ~ (1 /2)Kx 2 . 
Dashed curve: the perturbed potential V = (1 /2)Kx : -t- fix 4 . 


where a> = s/K fm is the classical oscillation frequency, and the corresponding 
wave functions are 


1 „2 

V2 


(M-v) 


■ 1/4 


se 


n 


with s = [(Km) 1/4 fh 1/2 J,v. 

Suppose we begin in the first excited state i/q with energy E — (3/2)tuo. We 
will calculate what happens to the energy of this state if we add a small anharmonic 
term to the potential (Figure 9.2): 

V(x) = ~Kx 2 +fix 4 

From Equation (9.19), the first-order perturbation is 

£ (I) = (n = \\fix 4 \n = l) 



9.1 Derivation of Time-Independent Perturbation Theory 
Expressing the perturbation in terms of x, we get 


203 


7(1) 


£ 


V2 ..-.’a 


7t 


1/4 


se 


fi—s* 

Km 


V2 _ s ln 

- : se * s ' /fc 


n 


1 / 4 “ 


ds 


r 

V jt K m ) ss= , 
15 tr 

T P ~Km 


s 6 e S 'ds 


With a> = yJKjm, the total energy including the perturbation is 


(i 


_ 15 hco 

E = hw{~ + jP~ 


Example 9.1 shows how to apply perturbation theory when the wave function 
is a function of position. However, perturbation theory can also be applied to the 
matrix representation of spin states. 


Example 9.2. Spins in a Magnetic Field. 

Suppose we begin with an electron having spin magnetic moment fL s in a strong 
magnetic field B z in the z direction (Figure 9.3). Recall from Chapter 8 that the 
potential for an electron with spin magnetic moment n s in a magnetic field B is 


V = —fi s • B 


where 


Hence, the Hamiltonian is 

H 0 = ——B - S 
h 

8s „ 

= —MaB • a 

and g s = 2 for the electron. The eigenstates of Ho are just the spin up and spin 
down states, | f ) and | j. ), with energies E+ = +Mb/T and £_ = 

Suppose we are in the spin up state. It) - and we add the small magnetic field 
B x x with B x B. (Figure 9.3). What is the change in the energy of the electron 
due to this new magnetic field? Our perturbing potential is 



204 


Chapter 9 Time-Independent Perturbation Theory 



FIGURE 9.3 We begin with an electron in the spin up direction in the magnetic field B z, 
and we add the small perturbation B x x. 




9.2 Perturbations to the Atomic Energy Levels 


205 


FIGURE 9.4 A given spectral line corresponds to a single transition between two different 
energy levels. If two supposedly degenerate levels are slightly separated in energy, a double 
line will be produced. 


physics, the elegant theory is extremely accurate, but it is not exact. There are 
small corrections to the theory due to internal interactions in the hydrogen atom. 
With perturbation theory, we now have the tools to derive these corrections. 

Fine Structure 

Recall that for hydrogen, the energy is given by 

E n = (-13.6 eVW 
n l 

In hot hydrogen gas, a series of spectral lines are observed, each one correspond¬ 
ing to a particular transition with hv — — £„ 2 . However, upon examining the 

spectrum closely, it is observed that some spectral lines are not really single lines 
but are closely spaced double lines. This feature is called fine structure. There is 
an obvious way to get such closely spaced spectral lines: they will be observed if 
the degenerate energy levels are not truly degenerate but separated in energy by a 
very small amount (Figure 9.4). Recall that a given energy level E n corresponds 
to 2 n 2 different states. Apparently, some sort of interaction, which we have not yet 
accounted for, splits some of these states apart in energy. 

Our simple model for the hydrogen atom considered only the Coulomb interac¬ 
tion between the proton and the electron. What we have neglected are the various 
magnetic fields produced in the atom. The orbital motion of the electron sets up a 
magnetic field with magnetic moment given by 

Hi = -~U 8i = 1 (9.21) 

2 m e 

while the spin of the electron produces a spin magnetic moment equal to 

g*e 

—S, 8s* 2 
2 m e 

So the orbital motion of the electron produces a magnetic field, and the electron 
itself acts like a small magnet embedded in this magnetic field (Figure 9.5), The 
electron will prefer to line up with its spin magnetic field in the opposite direction 
to the orbital magnetic field. Hence, the “spin up” and “spin down” states of the 
electron will have different energies. This is the basis of fine structure; it arises 
from the spin-orbit coupling of the electron. 



206 


Chapter 9 Time-Independent Perturbation Theory 



FIGURE 9.5 The fine-structure splitting is produced by the interaction between the spin 
magnetic field of the electron and the magnetic field produced by its orbital motion. 


More precisely, the interaction energy between the spin magnetic moment fi s 
and the external magnetic field B (produced by the orbital motion of the electron) 
is 


//, = -n s • B (9.22) 

To find B imagine that we are sitting in the rest frame of the electron watching the 
proton orbit around us. In the rest frame of the electron, the electric field of the 
proton, 


1 e 

E = ---r 

4neo r 3 

transforms into a magnetic field, B = — v x E/c 2 . Using L = r x p, we obtain, 
for the magnetic field induced by the orbital motion of the electron, 

1 

47T6o 

Substituting our expressions for /i s and B into Equation (9.22), we obtain, for the 
energy of the spin-orbit interaction. 



m e c~r 


■2f3 


Hi 


e 

4jt €om 2 c 2 r 3 


SL 


Unfortunately, this expression is wrong. The problem arises because we did the 
calculation in the rest frame of the electron, which is an accelerating frame of 
reference, so we cannot simply transform back into the rest frame of the proton 
and expect to get the right answer. When this error is corrected, an additional factor 
of 1 /2 is obtained (this correction is called Thomas precession after L.H. Thomas, 
who explained this effect in 1926). The corrected expression is 



(9.23) 




9.2 Perturbations to the Atomic Energy Levels 


207 


L 



FIGURE 9.6 The spin-orbit interaction Hamiltonian is proportional to S • L. Classically, 
S-L = 5Lcos6>. 


From a classical point of view, this expression makes sense, since we expect the 
interaction energy of the two magnetic fields to depend on the alignment of the S 
and L vectors (Figure 9.6). 

We can get a crude order of magnitude estimate of the size of this interaction 
energy by taking S ~ h, L ~ h, and r ~ 10“ 10 m. Then Equation (9.23) gives a 
perturbation energy of the order of 10“ 4 e V compared to a hydrogen binding energy 
of 13.6 eV. It is therefore a good approximation to treat the spin-orbit interaction 
as a small perturbation. 

In our expression for H\, the interaction is proportional to S * L = S X L X + 
S y Ly + S Z L Z . However, this expansion is essentially useless, since the electron 
is never in a state which is a simultaneous eigenstate of all three components of 
angular momentum. Instead, we use the standard procedure from Chapter 8 for 
dealing with dot products of operators. We define the total angular momentum 
operator J to be 

J = L + S 



which implies 




208 


Chapter 9 Time-Independent Perturbation Theory 


and the spin-orbit perturbation becomes 



In Chapter 6 we wrote the hydrogen wave functions in terms of the quantum 
numbers n, /. and m/, and in Chapter 8 we added the spin quantum number m v , 
but now we want to express the hydrogen wave functions as eigenfunctions of J 
and J-, i.e., in the form |n l j inj), where 

J 2 \n I j mj) = trj(j + 1 )|n l j mj) 

J \n / j m j) — hm j\n I j m j) 

Then the perturbation to the energy levels due to spin-orbit coupling is 

E %-ortn, = <" 1 J / j mj) 

2 

= (n l j mi . , (J 2 - L 2 - S 2 )\n I j m,) 

■ \67T( t ,injc 2 r' 

2 

= + 1) - /(/ + l) - S(s + l)J(n / j m, 1 . .- , , - \n l j m,) 

\bne { )m~c l r- 

What is the value of (n I j mj\(e 2 /\ 6 tt eom 2 e 2 r 2 )\n I j m } )? Note that this is 
just an integral over the radial wave function, which is a function only of n and I; 
hence, we can write 

{« l j mj\(e 2 /\6neom 2 e c 2 t J )\n l j mj) = f n , 

where /„/ is a function only of n and I. Then our expression for the change in E 
due to the spin-orbit interaction (taking s = 1 /2 for the electron) becomes 

E %-odn, = j (j + 0 “ HI + 1 ) ~ ~ C^- 4 ) 

From Chapter 9 recall that for / ^ 0. j has two possible values, either / — 1/2 
or / + 1 /2, while for / = 0 we have only j = 1 /2. From Equation (9.24). it is clear 
that the spin-orbit coupling splits each state with / ^ 0 into two different states with 
j = I + 1/2 having the higher energy and j = l - 1/2 having the lower energy- 

The expression for /„/ can be evaluated to yield a final expression for the change 
in E from spin-orbit coupling: 



(9.25) 





9.2 Perturbations to the Atomic Energy Levels 


209 


where 


4tt€qHc 


Note that a is a dimensionless number with a value of roughly 1/137. Because 
of its origin in this calculation, a is called the fine-structure constant , although ft 
crops up in many other areas of physics. Recall that the hydrogen energy levels 
E n are all negative, so we take the absolute value of E n in Equation (9.25) and 
in Equations (9.28M9.29) to avoid any confusion over the sign of the change in 
energy. 

However, this is not the full story of the fine-structure splitting. We have applied 
our standard nonrelativistie treatment to the electron, and this is an excellent ap¬ 
proximation, since the electron in the hydrogen atom is not highly relativistic (its 
kinetic energy is much smaller than its rest energy; classically, this corresponds to 
an electron velocity c much smaller than the speed of light). However, now that 
we are working in the realm of tiny changes in the energy levels, we have to take 
into account small corrections due to relativistic effects. 

In relativistic classical mechanics, the total energy of a particle with rest mass 
m and momentum p is 


E = s/p 2 c 2 + m 2 c 4 (9.26) 

For now we are interested only in the case where the particle is only slightly 
relativistic so that p <g; me. (We will relax this restriction in Chapter 15 when we 
discuss relativistic quantum mechanics in more detail.) In this limit we can expand 
the square root in Equation (9.26) to obtain 


me" + 


P 


an 


8m V 2 


(9.27) 


In the limit of small /?, each term in this expression is small compared to the 
preceding one. In relativity, the first term in Equation (9.27) is interpreted as the 
rest energy of the particle, while the remainder of the expression corresponds to 
the energy of motion. But what is the correct energy to use in the Schrbdinger 
equation? In the standard nonrelativistie Schrbdinger equation, the Hamiltonian 
operator corresponds to the kinetic energy plus the potential energy, and the rest 
energy plays no role. Hence, in writing the Hamiltonian, we use the second term 
in Equation (9.27) to give us the unperturbed Hamilton, while the third term gives 
the lowest-order perturbation due to relativistic effects. 

Then we have, for our perturbation, 





210 


Chapter 9 Time-Independent Perturbation Theory 


and the lowest-order change in the hydrogen energy levels is 

KlLirim = <" 1 J 1 J m .i) 

This expression can he evaluated for the hydrogen wave functions, yielding a final 
result of 

(9.28) 

Since riri4t ( C > s a function of n and / but not a function of j. the relativistic 
correction does not contribute anything to the splitting of the energy levels, which 
is determined entirely by the spin-orbit interaction, but it does change the overall 
dependence of the energy levels on n and /. Since l < n — 1, the term in brackets in 
Equation (9.28) is always positive, so the relativistic correction always decreases 
the energy levels. 

Note that E { " n . orhil and E l r ll lljvistjl . are roughly equal in magnitude; both of them 
are approximately a 2 E„. Hence, neither contribution to the tine structure can be 
neglected. We therefore add Equations (9.25) and (9.28) to get the total change in 
energy due to both the spin-orbit coupling and relativistic effects; 



The dependence on / has cancelled out, so that the total change in energy is a 
function only of j and n.The net effect of the fine structure is to split the / — 1/2 and 
/ + 1/2 states (due to the spin-orbit coupling) and to decrease the energies of both 
states relative to the unperturbed hydrogen energy levels (see Exercise 9.10). The 
fine-structure perturbation (both the decrease in energy relative to the unperturbed 
energy levels and the splitting between the j = / + 1/2 and the j = l — 1/2 states) 
is of order a 2 E„ ~ 10~ 4 £„. since the other factors in Equation (9.29) are all of 
order unity. 

We now introduce a standard if somewhat arcane notation to describe the angular 
momentum states of the hydrogen atom. In this notation, the different / states of 
the hydrogen atom are written with different capital letters: the 1 = 0 state is 
called the S state, the / = 1 state is called the P slate, the / = 2 state is called 
the D state, the / = 3 state is called the F state, and then the sequence continues 
alphabetically (G, H, /,...). (The origin of these abbreviations is buried in the 
history of spectroscopy; there is nothing particularly logical about them.) The 
standard way of writing the various j states is to indicate the value of j as a 
subscript, e.g., Pj /2 is the notation for / = 1, j = 1/2. [We will see an additional 
twist to this notation in Chapter 13.) The different hydrogen energy levels written 







9.2 Perturbations to the Atomic Energy Levels 


211 


n = 3 


$ 1/2 


n = 2 


S \/2 


Pm 
P 1/2 

^ 3/2 

P 1/2 


Q 5/2 

d 3/2 


FIGURE 9.7 The energy levels of hydrogen showing the fine structure (not drawn to 
scale). 



FIGURE 9.8 The spin-spin interaction between the proton and electron produces the 
hyperfine splitting. 


in this notation are shown in Figure 9.7. The S states (/ = 0) do not split since they 
correspond to a single value of j, while the / ^ 0 states split into the j — l + 1 /2 
state and the j = l — 1/2 state, with the former having higher energy than the 
latter. 

Finally, note that we have blithely applied first-order perturbation theory 
to degenerate states, ignoring the warning in the previous section. However, 
it is all right to use nondegenerate perturbation theory in this case, since 
(n l mi m s \H\\n /' m\ m' s ) = 0 if / ^ l' or mi ^ m\ or m s ^ m' s . 

There is a second internal magnetic interaction in the hydrogen atom with a 
much smaller effect. The proton has a spin magnetic moment given by 

H- P = (9.30) 

1 2 m p 

with g p % 5.6. So the electron also feels this magnetic field and is perturbed by 
it (Figure 9.8). However, a comparison of Equation (9.30) with Equation (9.21) 
shows that the ratio of the spin magnetic field of the proton to the magnetic field 
produced by the orbital motion of the electron is roughly m e /m p ~ 6 x 10 4 . 
Hence, we expect the splitting from the spin-spin interaction to be much smaller 
than the effect of the spin-orbit interaction. 






212 


Chapter 9 Time-Independent Perturbation Theory 


Nonetheless, this spin-spin interaction does produce a splitting in energies. 
Since it is so much smaller than the fine structure, it is called hyperfine splitting. 
In the ground state of hydrogen, for example, the triplet (S = 1) state has a higher 
energy than the singlet (S = 0) state: the energy difference is A E = 5.9 x 10~ 6 eV. 
Although this energy difference is tiny, hyperfine splitting has an importance out of 
proportion to its magnitude. The universe contains clouds of neutral hydrogen gas: 
this gas radiates by dropping from the triplet into the singlet state. The frequency 
of this radiation isv = A E/h = 1420 MHz, corresponding to a wavelength of 
a = 21 cm: the famous “21-centimeter line.” 


The Lamb Shift 

The fine-structure calculations in the previous section predict that the hydrogen 
energy levels do not depend on /. Hence, two states with the same n and j quantum 
numbers but different values of / should be degenerate in energy. In 1947 Willis 
Lamb and his student, R.C. Retherford, showed experimentally that this was not 
the case. Specifically, they measured a splitting between the n = 2, S \/2 state and 
the n = 2, P \/2 state. 

This splitting, now called the Lamb shift, cannot be explained in the context 
of quantum mechanics, but arises from the more esoteric area of quantum field 
theory (which was, in part, motivated by Lamb’s experimental result). Quantum 
field theory is beyond the scope of this book; here we will simply use one of the 
predictions of the theory. 

In quantum field theory, the vacuum is no longer simply empty space; it is 
literally seething with activity. Virtual particles, such as electron-positron pairs, 
can pop into existence and disappear. As long as the energy of the particles E and 
their lifetime t satisfy Et < hf 2, then these particle-antiparticle pairs cannot be 
detected directly. [This is a rather crude explanation, which is made much more 
precise within the framework of quantum field theory.] 

These particle-antiparticle pairs produce an effect called vacuum polarization. 
Consider a dielectric surrounding a point positive charge. The point charge polar¬ 
izes the dielectric, attracting negative charge inward and repelling positive charge 
outward. This tends to cancel the electric field produced by the point charge, lead¬ 
ing to a reduced electric field inside the dielectric (Figure 9.9). 

Now consider the same positive charge in a vacuum. The production of virtual 
electron-positron pairs tends to cancel the charge, just as in a physical dielectric. 
However, unlike a dielectric, we can never remove the positive charge from the 
vacuum polarization to measure its true charge: the charge we measure has already 
been cancelled by the effect of the vacuum polarization. This means that the “bare 
charge, which cannot be measured directly, is much larger than we thought; in tact, 
it is mathematically infinite! 

The upshot of all of this is that the electric field of a point charge must be 

modified at the origin (where the charge is “infinite”), but everywhere else m 



9,2 Perturbations to the Atomic Energy Levels 


213 


Dielectric 


Vacuum 





FIGURE 9,9 In a dielectric, polarization reduces the electric field produced by a point 
charge. Vacuum polarization produces the same effect in a vacuum. 


space the charge has already been cancelled by the effects of vacuum polarization, 
so the electric field is unchanged. The result for V (r), derived from quantum field 
theory, is 


V (r) = - 


e 2 1 
4tt6o r 


ae 2 h 2 3 
15tt6o m\c 2 ^ 


(9.31) 


where S 3 (r) is the three-dimensional Dirac delta function discussed in Chapter 7. 
The second term in Equation (9.31) is the perturbation to the Hamiltonian, so the 
first-order shift in the energy is 



Recall that the hydrogen wave functions are all identically zero at the origin, except 
for the / == 0 states. Thus, the effect of the Lamb shift is to reduce the energy of the 
/ = 0 states relative to the corresponding / ^ 0 states. The effect is smaller than 
the fine-structure splitting, e.g., for the n = 2 states, the splitting between the / = 0 
and l = 1 state is about 1CT 7 eV. As bizarre as all of this sounds, it is important to 
remember that it is based on solid experimental evidence. 







214 


Chapter 9 Time-Independent Perturbation Theory 


9.3 BTHE ATOM IN EXTERNAL ELECTRIC OR MAGNETIC FIELDS 

In the previous section, we discussed perturbations which are intrinsic to the atom. 
We will now examine what happens when the atom is placed in an external elec¬ 
tromagnetic field. Since the atom consists of charged particles, and the electons 
produce both a spin and orbital magnetic moment, any external electric or magnetic 
field will perturb the energy levels of the atom. The effect produced by an external 
electric field is called the Stark effect, while the effect of an external magnetic field 
is the Zeeman effect. 

The Atom in an Electric Field: The Stark Effect 

We will first examine the effect of a uniform electric field with magnitude £ on 
the ground state of hydrogen. Recall that the ground-state wave function is 



where the “100” subscript denotes the n l mi quantum numbers. We can ignore the 
spin state of the electron, since the spin interacts only with magnetic fields through 
the electron’s spin magnetic moment. (Of course, we will have to consider spin in 
the next section when we discuss external magnetic fields.) We take the electric 
field to be uniform, static, and pointing in the z direction. 

Since the ground state of hydrogen is nondegenerate, we can use the perturbation 
theory expressions from Section 9.1. Classically, the potential energy of a charge 
—e in an electric field £ is V — e£z, so the perturbation H\ produced by the electric 
field is 



and the first-order change in the energy of the hydrogen atom is, from Equation 
(9.19), 

E {]) = (f m \e£z\fm) (9.32) 

Taking z = r cos 0 and writing Equation (9.32) in spherical coordinates, we get 




9.3 The Atom in External Electric or Magnetic Fields 


215 


Since the first-order perturbation vanishes, we must use second-order perturba¬ 
tion theory to calculate the change in the energy due to the external electric field. 
Equation (9.20) gives 


= y, 

nJjrij 


E x -E„ 


(9.33) 


where E n = 13.6 eV jn 1 . Recall from Chapter 6 that every hydrogen wave function 
can be written as the product of a radial wave function R n /(r) and the spherical 
harmonic y/"(0, <p). Then the inner product which appears in Equation (9.33) can 
be written in the form 

(ir m \e£zWnim,) = e£ j R* 0 (r)Y°*(8, <f>) rcosd /?„,(r)E/"((E 4>) d\ (9.34) 

But now recall that Eg = 1 /y/4n , and E,° = y/J/Ari: cos 9, so Eg cos# = (l/\/3)E 1 0 . 
This allows us to write Equation (9.34) as 

Moo \eSzWnim,) = ~ J R* xa (r)R nl (r)C dr j Y?*(0,<P)Y, m (8, </>) sin 0 d6 d<f> 

But the E/"’s are orthonormal, so 

/ Y?*(0, (t>)Y, m (8,(1)) sin9 d$d(p =1 (/ = 1 , m = 0) 

= 0 (/ 1 or m 7 ^ 0 ) 

Hence, in the sum in Equation (9.33), only the / = 1, m = 0 terms are nonzero 
giving 


r<2) 


L 


\(e£/y/3) / R* 0 (r)R„](r)r 3 drl 2 

Ei - E n 


(9.35) 


The integral under the sum in Equation (9.35) can be evaluated exactly for all 
values of n, and the terms in the series decrease rapidly with n: 


q 

E (1) = —(4^ e 0 )tfo£ 2 0-48+ 0.20 + 0.066 +■■•) = --(An^alE 2 


The change in energy is negative, since the hydrogen atom becomes polarized and 
aligns itself so as to partially cancel the external electric field (Figure 9.10; see 
also Exercise 9.2). Since the change in energy is proportional to the square of the 
applied electric field, this effect is called the quadratic Stark effect. 

Our use of nondegenerate perturbation theory breaks down for the excited states 
of hydrogen, since these states are degenerate. Using degenerate perturbation the¬ 
ory, it is possible to show that the change in energy for these excited states is 
proportional to £ rather than £ 2 . Hence, the change in energy when an electric 
field is applied to the excited states of hydrogen is called the linear Stark effect. 




216 


Chapter 9 Time-Independent Perturbation Theory 



FIGURE 9.10 A classical picture of the quadratic Stark effect: the hydrogen atom is 
polarized by the external electric field, and the field produced by the polarized atom is in 
the opposite direction to the external field. 

The Atom in a Magnetic Field: The Zeeman Effect 

Now consider what happens when we apply an external magnetic field B to the 
hydrogen atom. Assume that the magnetic field has magnitude B and is pointing 
in the z direction so that 


B = Bz 



9.3 The Atom in External Electric or Magnetic Fields 


217 



FIGURE 9.11 The magnetic moment pt of the hydrogen atom is proportional to L + 2S, 
while the total angular momentum J is proportional to L -f S, so pi and J are not, in general, 
parallel. 


The potential energy of a magnetic dipole pi in a magnetic field B is just V = 
—pi • B, so the perturbation produced by the magnetic field is 



(9.36) 


In Equation (9.36), there are two contributions to the atomic magnetic moment: the 
contribution from the orbital magnetic moment fi t and the contribution from the 
spin magnetic moment of the electron fi s . (In principle, we should also include the 
spin magnetic moment of the proton, but this is much smaller and can be ignored.) 
Hence, the total magnetic moment is 


Iu. = fi, + n s 

gl^B 


L- 


gs^B, 


Recall that gi = 1 and g s <=» 2, so the expression for n becomes 

= L + 2S] (9.37) 

h 

Note that the magnetic moment of the atom is not proportional to the total angular 
momentum operator J, which is L + S. In classical terms, the angular momentum 
vector J and magnetic moment vector fi are not parallel (Figure 9.11). This has 
important consequences for the Zeeman effect. (You are already familiar with a 
much larger classical system in which the angular momentum and magnetic dipole 
are not parallel: the earth!) 




218 


Chapter 9 Time-Independent Perturbation Theory 


We can use Equation (9.37) to rewrite the perturbation in Equation (9.36) as 

H\ = B — [L. + 2S-] 
n 

Applying this perturbation to the hydrogen state |n / m; m,) gives the first-order 
change in energy, 

E il> = (n I ni/ m s \{BfAfi/h){L z + 2S : )\n I nt/ m s ) 

~ Bfj.fj(mi + 2m,) (9.38) 

The problem with this result is that it requires the atom to be in a state of definite 
mi and m, (or, equivalently, an eigenstate of S~ and £-). However, we saw in our 
discussion of fine structure in Section 9.2 that the spin-orbit coupling drives the 
atom into an eigenstate of J 2 , which does not commute with .S'- and L.. Hence, 
the atom is in a state of definite j and m s rather than m/ and m s , so our argument 
would appear to be invalid. 

To clarify this situation, we can write the full Hamiltonian as 


H = H 0 + 


RjT€om;c 2 r : ' 


S • L + B~[L- + 25- 
h 


(9.39) 


where the second term is the perturbation due to the spin-orbit interaction (given 
in Equation 9.23), and the third term is the perturbation from the external magnetic 
field. 

Now consider two possible cases: for very strong magnetic fields (B S> IT), the 
third term in Equation (9.39) dominates the second term, while for weak magnetic 
fields ( B 1 T), the second term dominates the third. Consider the case of strong 
magnetic fields first. For this case we simply ignore the effect of the spin-orbit 
coupling; the strong magnetic field overwhelms the spin-orbit coupling and drives 
the atom back into a state of definite m/ and m s . Therefore, for the strong magnetic 
field case, the expression we derived for E {U in Equation (9.38) is correct: 


£ (l> = BubOh/ + 2m,) 


This regime of the Zeeman effect is called the strong-field Zeeman effect or the 
Paschen-Back effect. An illustration of this perturbation in the energy levels is 
shown in Figure 9.12 for the case / = 1. 

Now consider the opposite regime in which spin-orbit coupling dominates the 
effect of the external magnetic field. In this case the atom is in a state of definite 
j and m j rather than /«/ and m s , and the perturbation must be written as 

£ (l) = {n l j inj\(BtLftlh){L : + 2S z )\n l j m ,) 





9.3 The Atom in External Electric or Magnetic Fields 219 

m, + 2m s 



nif = ~ I, m s = ~ 1/2 


FIGURE 9,12 The strong-field Zeeman effect for the energy levels of an / = 1 state in 
hydrogen. 


E (i) = (n I j mj\(BfA B /h)(J z + S : )|« l j mj) 

Hu 

— Bubm j H- ~(n l j m j\S : \n I j m ,) (9.40) 

n 

In order to further simplify this expression, the state j n l j mj) must be written as a 
linear combination of the | n l m } m s ) states. From Chapter 8 , we know that s = 1/2 
and a given value of I can couple to give either j = / + 1/2 or j = l — 1 /2, while 
Mj = mi + m s . The actual linear combination is 

I; = /+ 1/2, mj) 

I j - l- 1/2, mj) 

We can use these equations to solve for (n l j m, | 5 ; |« l j mj). For j = l + 1/2, 




220 


Chapter 9 Time-Independent Perturbation Theory 


we get 


(n I j mj\S : \n l j htj) 


/ I + 1/2 + wA 

V 21 + 1 ) 


1/2 


+ 


x S 


+ 


21 -f 1 

/ + 1 / 2 -m^ 1/2 

2 / + i 


(m, = m, - 1/2, m s = 1/2| 
m/ = m j + I /2. m s = —1/2| 


/ + 1/2 4 - m 


21 4 - l 

/ + 1 / 2 -m 1/7 


1/2 


|/w/ = mj — 1/2, w s = 1/2) 


2 /+ 1 

+ 1 /2 + m 


J 1 |mi — nij + 1/2, m s = -1/2) 

+ 1/2 — nij 


h // + 1/2 + mA ft //+ 1 / 2 -mA 

2 V 2/ + 1 j 2 V 21 + 1 J 


= mjh 

21 + 1 

Similarly, for j =1 — 1/2, we obtain 


(« / j Wj\S z \n I j nij) 


trijh 


21 + 1 

Combining the results for j = / + 1 /2 and j = / — 1/2, we get 


(« I j nij\S : \n 1 j nij) 


mjh 

2 / + 1 


2(j - /) 


and substituting this result into Equation (9.40) yields 

(Itt) 


£ <n 


In analogy with the g. v factor for the electron spin and gj for the orbital angular 
momentum, we can write 



where this g is called the Lande g factor. In terms of the Lande g factor, the energy 
shift becomes 


gB/xgin j 


In contrast to g s and g; which are constant, the Lande g factor is not constant, 
but rather is a function of j and /. The reason for this is the fact, already alluded 
to, that ft is not parallel to J, since the operator which determines /u is L + 2$ 




Exercises 


221 




FIGURE 9.13 The splitting of the P i/2 and P u2 states in the weak-field Zeeman effect. 


while J = L + S. Hence, the ratio between ft and J can depend on the relative 
orientation of L and S (Figure 9,11), so is not a fixed multiple of nij . The effect 
of the weak-field Zeeman effect is to split the energies of the individual m , levels, 
with a magnitude which depends on both the magnitude of the magnetic field and 
the value of the Lande g factor (Figure 9,13). 

To summarize, for weak magnetic fields, the hydrogen atom can be taken to 
be in a state of definite j and m /, and the magnetic field separates the energies of 
the individual states. As the magnetic field is increased, it eventually becomes 
stronger than the internal magnetic fields of the atom. In this limit, the magnetic 
field drives the hydrogen atom into a state of definite w/ and m s , and the perturbation 
in energy is just proportional to ni\ + 2 m s . 


EXERCISES 

9.1 (a) In Example 9.2, the energy of the system can be calculated exactly. Take B = 
B x x -f B z z » and calculate the exact energies. 1 Mint: Feel free to use a different 
coordinate system; the energy levels cannot depend on the choice of the coordinate 
system]. 

(b) Take the answer in part (a) and expand it out in powers of B Xf remembering 
that B k <K B- . Show that the terms proportional to B x and B~ correspond to the 
answers derived in Example 9.2. 



222 


Chapter 9 Time-Independent Perturbation Theory 


9.2 A particle is in a potential Vo in its ground state )^o). A small perturbation H\ is 
applied to the particle. Suppose that the first order perturbation to the energy is zero: 
£ (1) = {i//{)j M\ j= 0. Show that the lowest-order effect of H\ is to decrease the 
energy of the ground state. 

9.3 A particle of mass m is confined to move in a one-dimensional square well with infinite 
potential barriers at x = 0 and x — a , with V — 0 for 0 < x < a. The particle is in the 
ground state. A perturbation H\ = 18(x — a/2) is added, where a is a small constant. 

(a) What units does a have? 

(b) Calculate the first-order perturbation E iU due to H\. 

(c) Calculate the second-order perturbation E (2) . The answer may be expressed as 
an infinite series. 

9.4 A particle of mass m is confined to move in a narrow, straight tube of length a which 
is sealed at both ends with V = 0 inside the tube. Treat the tube as a one-dimensional 
infinite square well. The tube is placed at an angle 0 relative to the surface of the 
earth. The particle experiences the usual gravitational potential V = mgh. Calculate 
the lowest-order change in the energy of the ground state due to the gravitational 
potential. 

9.5 A particle of mass m is in the ground state in the harmonic oscillator potential 

V(x)= \tCx 2 

A small perturbation fix 6 is added to this potential. 

(a) What are the units of fil 

(b) How small must fi be in order for perturbation theory to be valid? 

(c) Calculate the first-order change in the energy of the particle. 

9.6 In the hydrogen atom, the proton is not really a point charge but has a finite size. 
Assume that the proton behaves as a uniformly-charged sphere of radius R = 10“ 1 " m. 
Calculate the shift this produces in the ground-state energy of hydrogen. 

9.7 The photon is normally assumed to have zero rest mass. If the photon had a small mass, 
this would alter the potential energy which the electron experiences in the electric field 
of the proton. Instead of 




Exercises 


223 


9.8 Suppose that that the proton had spin 0 instead of spin 1 /2. 

(a) How would this alter the fine structure of the energy levels of the hydrogen atom? 

(b) How would this alter the hyperfine structure of the energy levels of the hydrogen 
atom? 

9.9 We have seen that the spin-orbit interaction splits the / ^ 0 states in the hydrogen 
atom into j = l 4* 1/2 states (with slightly higher energy) and j = l — 1/2 states 
(with slightly lower energy ). Suppose that the electron had spin 1. How many different 
energy levels would the spin-orbit interaction produce, and what would their relative 
energies be? Be sure to consider how the answer would depend on the value of /. 

9.10 Equation (9.29) gives the fine-structure energy shift. 

(a) Show that the j = l -f 1 /2 state has a higher energy than the j = I — 1/2 state. 

(b) Show that the change in energy, E ( p ne structure , is always negative. 

9.11 An electron is in the ground state in a three-dimensional rectangular box given by 
0 5 v < , 0 < y < /?, and 0 < z < c, where V 7 — 0 inside the box, and there are 
infinite potential barriers at all of the walls. A homogeneous, static electric field with 
magnitude £ is applied in the x direction. What is the lowest-order change in the 
energy of the electron? 

9.12 A hydrogen atom in its ground state is placed in a homogeneous, static electric field 
with magnitude £ in the x direction. 

(a) Show that the first-order perturbation £ (l) is 0. 

(b) Show that the second-order perturbation E (2) is the same as if the field was 
pointing in the z direction, [This is obvious from symmetry, but calculate E i2) 
using perturbation theory and show it explicitly.] 

9.13 A hydrogen atom is in its ground state, A proton is fixed in space a distance R from 

the nucleus of the hydrogen atom, where R Calculate the perturbation to the 

energy of the hydrogen atom due to the electric field of this proton. 

9.14 The electron in a hydrogen atom is in a D state. A homogenous, static magnetic field 
is applied in the z direction. 

(a) Draw a diagram showing the splitting of the energy levels in the weak-field limit. 
Calculate the value of g for each energy level. 

(b) Draw a diagram showing the splitting of the energy levels in the strong-field limit. 

9.15 (a) A particle is in a state | \fr) which is an eigenfunction of the Hamiltonian Hq with 

energy E. A perturbation H\ is applied such that H\\yfr) =0. Show that the energy 
of the system is completely unchanged by this perturbation. 

(b) In the ground state of the helium atom, both electrons are in the / = 0 state, and 
the spin wave function for the two electrons is the singlet spin state (5 = 0 and 
m s = 0). [This is a consequence of the Pauli exclusion principle, which will be 
discussed in Chapter 13.] A homogeneous, static magnetic field is applied in the 
z direction. Show that the energy of the ground state of helium is completely 
unaffected by this magnetic field. [Ignore the magnetic moment of the nucleus.] 
What is the physical reason for this? 




CHAPTER 



The Variational Principle 


In the previous chapter, we began w ith systems for which the Schrodinger equation 
could be solved exactly, and calculated the change in energy when a small, time- 
independent perturbation was added to the Hamiltonian. The next logical step 
is to examine time-dependent perturbations. However, before doing so, we will 
take a slight detour and develop a technique called the variational principle. The 
variational principle applies to time-independent systems, but it is not a form of 
perturbation theory; i.e., it does not assume that a small perturbation is applied to a 
known exact solution. Instead, the variational principle is a technique for estimating 
the ground-state energy of an arbitrary Hamiltonian for which the Schrodinger 
equation cannot be solved at all. For example, the Schrodinger equation cannot be 
solved exactly for atoms with more than one electron, hut the variational principle 
provides a tool to estimate the ground-state energies for such atoms. 

The variational principle is based on a simple idea: the expectation value of 
the Hamiltonian calculated for an arbitrary wave function gives an upper bound 
on the ground-state energy. By ‘"varying” the wave function used to calculate this 
expectation value and picking out the smallest resulting value for the expectation 
value, we obtain an estimate for the ground-state energy. We will derive this result 
below and then apply it to two different examples. 


10.1 ■ VARIATIONAL PRINCIPLE: THEORY 

Suppose we have a Hamiltonian H with a set of energies and eigenfunctions: 

Hm^EM 
HWx) = EilVri) 


Hm = EMn) 


Assume further that w e cannot necessarily solve for any of the energies or eigen¬ 
functions explicitly. Now suppose that we choose a completely arbitrary wave 
function |i//}, which need not be normalized, and we calculate the expectation 
value of //, {^//1//1^)/(i/z|^}. (The \xj/) in the denominator is necessary be¬ 
cause we have not assumed that |i/0 is normalized.) The variational principle is 


225 



226 


Chapter 10 The Variational Principle 


based on the fact that this expectation value gives an upper bound on the ground- 
state energy Eq\ 


W\HW) ^ n 

-—*— £im 

im) ~ 


( 10 . 1 ) 


First we will prove the result in Equation (10.1), and then explore the consequences. 

Recall that the eigenfunctions of H can be chosen to be an orthonormal basis 
set. so the arbitrary wave function |i jf) that appears in Equation (10.1) can be 
expanded as a linear combination of these eigenfunctions: 


|l/r) = C()|^o> + C’l IVo) H-h C n \f n ) H- 

Substituting this expansion into the numerator on the left-hand side of Equation 
(10.1) gives 

W\W) 

= (c* o m + c *(Vo ! + •■■ + | + ■ • )H (t'olV^o) + ci IVo) + • ■ • + c„ |V 7 //) + • * •) 

But H simply pulls out the appropriate energy when it operates on each eigen¬ 
function, so 


W\H\f) 

— (C* o m + | + ■ • • + C*(Vr„| + • ■ •)(Co£olV f o) + c lE\ IVo) + • • * + C)£ JVC) + ’ ' ') 

( 10 . 2 ) 

Because the eigenfunctions are orthonormal, (i/AdVA)} = 1, (VolVo) = I...., 
(V r «IV r «) = 1. •.., and all of the terms containing (VAhIVO with m ^ n are zero. 
Therefore, Equation (10.2) simplifies to 

W\H\f) = |c 0 | 2 £o + k,| 2 £i + • • ■ + !c„| 2 £„ + • • • 

Similarly, the denominator in Equation (10.1) can be expressed as 

(V'lVO = (c* 0 {ifo\ + c'*(Vo I + • • • + C*(VcI + ■ ■ •Hc'olV'o) + t‘i IVo) + ■ • • + c n \\[r n ) + • - 
= kol 2 + kil“ + ■ • • + knl 2 + • • ■ 
so 

W\HW) = ko[ 2 £o + kil 2 g|+ --- + k„l 2 £ n +••• 3) 

WV0 kol 2 + kil 2 +—fknl 2 + --- 

Because Eq is the ground-state energy, it must be true that 

E() < E\ < • • • < E n < • • ■ 

so the numerator in Equation (10.3) satisfies 

kol“Eo “b kik-^i + • • • + k«I ~E„ + ■ ■ ■ > \co\~Eq + |ci |“£() + ■•' + \c„ VEq -I 




10.1 Variational Principle: Theory 


227 


OAj H |0> 
('/' | </') 



FIGURE 10.1 Because (^(a)|//|^(Qf))/(V'(o')jr/r(or)} > £«. the best estimate for E () is 
obtained by rninimi/ing {^/(a)\H\i/f(a))/(\j/(u)\}j/(a)). 


Substituting this bound into Equation (10,3) yields 

(f\Hm > kol 2 Eo 4- |ci | 2 £q -)-■■■ + \c„\ 2 Eq T- 

(fW) ~ kol 2 + ki| 2 t— + k„l 2 + --- 

which reduces to Equation (10.1), 

Equation (10.1) says that (t//1 H | \lr )/ (xj/ 1 yff) gives an upper bound on the ground- 
state energy, hut how can it be used to estimate the ground-state energy for the 
Hamiltonian //? One possibility would be to simply substitute a large number of 
different wave functions | rfr) into the left-hand side of Equation (10.1) and pick the 
one which gives the smallest answer. Since we are guaranteed that this quantity 
is an upper bound on £o, the smallest answer will give the best approximation 
to E { ). There is. however, a method to sample an infinite number of trial wave 
functions. We can write the trial wave function | \fr) as a function of some continuous 
parameters, which we can vary. Then the quantity (\j/(a )\H\i//(a))/(}//(a ){}//(&)) 
is acontinuous function of a, and it is guaranteed to be larger than £ () . By choosing 
a so as to minimize (ty(a)\H\^(a))/(\j/(a)\}ff(a))< we obtain the best estimate 
of E{) (Figure 10.1). What happens if we get lucky and accidently choose a form 
for \yfj(a)) for which a value of a exists that makes \\fr(a)) exactly equal to the 
ground-state wave function |i// 0 }? In this case it is easy to see (Exercise 10.1) that 
the estimate given by Equation (10.1) is exactly equal to the ground-state energy. 




228 


Chapter 10 The Variational Principle 



Example 10.1. The Bouncing Ball. 

Consider a particle subject to the linear potential V (x) = mgx but with an infinite 
potential barrier at x = 0 (Figure 10.2). Estimate the ground-state energy using 
the variational principle. 

First we need to choose a trial wave function \j/(x). There is no “correct” choice 
for t/r(x), but the more closely xjs(x) can be made to resemble the true ground- 
state solution of the Schrodinger equation with V(x) = mgx, the more accurate 
our variational estimate of the ground-state energy will be. We first note that the 
infinite potential barrier at x = 0 will give xj/( 0) = 0 for the true ground-state 
wave function, so our variational wave function should also have this property. 
Further, since we are dealing with a bound state, we expect the true wave function 
to satisfy <//(x) —* 0 asx —>• oo. Finally, our experience with solutions of the one¬ 
dimensional Schrodinger equation (Chapter 4) indicates that the true ground-state 
wave function will not cross the x-axis for x > 0. There are still an infinite number 
of functions with these desired properties, so our final choice will be somewhat 
arbitrary. We will use the simple trial wave function 

xfr(x) = xe~ ax 

Note that this wave function increases from x// — 0 at x = 0 up to a maximum at 
x = 1 /a, and then \[/ decreases exponentially to 0 as x -* oc. 




10.1 Variational Principle: Theory 


229 


The next step is to calculate {^r\H\yjf) / (\jf\yj/). The numerator is 

mrnir) = f 

Jo 


Pi- 3 2 

(xe ax ) +mgx \ (xe~ ax )dx 

2m dx l 


3 mg tr 

+ 


h 2 2 2 , h 2 
——a x H- ax I ax 


tin 


m 


8o? 4 8 ma 


and the denominator is 


{flif) 


fOQ 

= / xV 2tw 

Jo 


dx 


4a 3 

so the total expression to be minimized is 

W\H\ir) 3 mg 


, * 2 
+ -—a 


{\j/\\p) 2a 2m 

Taking the derivative of the right-hand side and setting it to zero gives 

3 mg h 2 

-i T H- a = 0 

2a- m 


(10.4) 


with the solution 


a = 


3w 2 g 

2h 2 


1/3 


Now this value for a must be substituted back into Equation (10.4) to give the 
estimate for the ground-state energy. We obtain 




estimated 


5/3 


(mg h~) 


2fc2xl/3 


To see how our estimate depends on the actual choice of the trial wave function, 
see Exercise 10.5, 


The variational principle, therefore, is a three-stage process: first, choose a 
trial wave function that depends on a parameter a ; second, vary a so as to mini¬ 
mize and third, substitute the resulting wave function back into 

Of\H \^)/to obtain the estimate of the ground-state energy E$. It is guar¬ 
anteed that the answer will provide an upper bound on £q, and it may provide 
an excellent approximation to £o, depending on how closely the trial wave func¬ 
tion can be made to resemble the true ground-state wave function. Some common 



230 


Chapter 10 The Variational Principle 


mistakes in using the variational principle are forgetting to include the kinetic 
energy term in H when evaluating (t/'|//|y)). forgetting to divide by {ifr |t//), and 
forgetting to complete the solution by substituting a back into the expression for 

WHM/ww). 

There is no “correct” choice for the trial wave function, but the form of the 
potential will often provide a guide as to a reasonable choice. As noted, the goal 
is to pick something which can be made to resemble the true ground-state wave 
function as closely as possible. It is also possible to use the variational principle 
to estimate the first excited-state energy (see Exercise 10.2 and Exercise 10.6). 

10.2 ■ VARIATIONAL PRINCIPLE: APPLICATION TO THE HELIUM ATOM 

As we have seen, quantum mechanics provides an excellent description of the 
hydrogen atom. The energy levels and wave functions can be calculated via the 
Schrodinger equation, and perturbation theory can predict the small changes to 
these energy levels due to magnetic interactions in the atom. Now it is time to 
reveal an unpleasant fact: the Schrodinger equation cannot be solved in a simple 
way for any of the other atoms! Even adding a single electron, to produce a helium 
atom, yields an intractable Schrodinger equation. On the other hand, the calculation 
of the ground-state energy of helium is the “classic" problem for the variational 
principle. 

The helium atom contains a nucleus of charge +2e and two negatively-charged 
electrons (Figure 10.3). Labelling the electrons “1” and “2”, the Hamiltonian for 
the helium atom is 


10.5) 

where, for simplicity, we use the electron mass rn e rather than the reduced mass. 
In this equation, ri and ri are the positions of the two electrons relative to the 



FIGURE 10.3 The helium atom contains two electrons at positions r t and m relative to 
the nucleus. 




10.2 Variational Principle: Application to the Helium Atom 


231 


nucleus with corresponding radial components r* and r>. The operators P\ and 
f J 2 are the momentum operators for the two electrons, given by derivatives with 
respect to the coordinate of the appropriate electron, e.g.. P\ — —itiV j, where 
V, — x(d/dx \) + v(3/3>*i) + 5(3/3ci), etc. Then the physical meaning of the 
various terms in the Hamiltonian is clear: the first and second terms give the 
kinetic energy of the two electrons, the third and fourth terms are the potentials 
that each electron feels in the Coulomb field of the nucleus, and the last term gives 
the change in the energy due to the mutual repulsion of the electrons. It is this last 
term which causes all the trouble in trying to find an exact solution. 

Instead of looking for an exact solution, we will use the variational principle 
to estimate the ground-state energy. First, consider an arbitrary nucleus of charge 
Z with only a single electron orbiting it. In this case the Schrbdinger equation 
can be solved exactly, just as in the case for hydrogen, but now the charge e for 
the nucleus must be replaced by the charge Ze. Recall from Chapter 6 that the 
ground-state wave function for hydrogen is 


fir) 




so the ground-state wave function fora single-electron atom with charge Z on the 
nucleus is 


fir) = 


1 (Z 


tA 


do 


- Zr/flo 


( 10 . 6 ) 


Note that the wave function for helium should be a single function of the positions 
of the two electrons, not two separate wave functions for the two electrons. We 
will take, as our trial wave function, the product of two wave functions of the form 
given by Equation (10.6): 



Rather than setting Z = 2 for the charge on the helium nucleus, we will take Z to 
be our variational free parameter. This choice is reasonable from a physical point 
of view: each electron partly cancels the positive charge that the other electron 
feels from the nucleus, so the "effective" value of Z should lie between 1 and 2. 

Equation (10.7) simplifies to 


1 (\ 

zA 


fir j, r 2 ) = — 1 - 

7T \C 

~ e 

*0/ 

r i +r»)/«o 


Note that this wave function is already normalized, i.e., (f\f) = 1 for any value 




232 


Chapter 10 The Variational Principle 


of Z. Then the quantity to be minimized is 


Pi 


Pi 


2e 2 


W\HW) = + Wlz-HtA) - 

2 m e 2m e An e or) 


W 


z<r 


AiKuri 


m + (f\ 


47rf()|ri — r> 


-m 


By symmetry. 


and 


p - 

W—Mt/O = \~\f) 

2 m e 2 m e 


W\ 


2e 2 2e~ 

\xfr) = ( 01 --|t//) 


4/reor 


Using these results, we get 
P 2 


2e 2 


W\H\yf,) = 2{yf,\-±-\f) -2M- 

2m e 4/rcori 


4 7T 6 ()/';> 


V/) + (r/r| 


4zr eo | «* i — r 2 


■IV') (10.8) 


The integrals in the first two terms on the right-hand side of Equation (10.8) are 
straightforward; they yield 


2m T L ~m 

2m e 


e Z 2 


47T€ 0 ao 


and 


2e 2 e 

-2w\-—m = 


477 60/' 


4jr 6 q£Jq 


(—4Z) 


(10.9) 


( 10 . 10 ) 


The third term on the right-hand side of Equation (10.8) is not so easy to evaluate, 

so we derive it in more detail. Written as an integral, this term is 


(f\ 


4/reolr! - r 2 
e 


m 


An 6 i 

e 2 


L/ i(£) 

6 0 J n \a 0 J 

2 __(Z\ 6 f _ 

3 6 0 \ a<) J J v/(F 


-Z(r l +r2)/u» 


1 


1 


r Z (n +r 2 )/a o f/ / r 


|ri - r 2 | 7r \a 0 / 

-2Z(ri+r>)/tfo 

.... = dh l J 3 r 2 


i - r 2 ) • (ri - r 2 ) 

2Z(r] +/’2 )/«o 


e 2 ( Z\ f e ~-2Z(r\-br 2 }/a 

J J r 2 + r 2 _ 2,.^ 


r7 3 ri J 3 r 2 


cost? 


12 


where 0 (2 is the angle between the vectors r t and r 2 . We perform the integral over 
r 2 first. Since iq is treated as a constant for the integration over r 2 , we can choose 



10.2 Variational Principle: Application to the Helium Atom 


233 


a spherical coordinate system in which 1*1 lies on the polar axis. Then 9 12 gives the 
angle between r 2 and the polar axis: it is just in the integration over r 2 in polar 
coordinates. We get 




e 


47Te 0 |ri - r 2 | 


m 



2X(/'i )/di) 

— : - - -- rf dr 1 sin (9] dB\ dcpir; dr 2 sin @2 d0 2 d(pz 

J rf + rf — 2r\ri cos 


Performing the integration over O 2 and $2 gives 


4rre ( iir, -r 2 | 


llO 



(2tt)c 


-2Z(n4r 2 )/«u 


J rr 4r ff + 2r\T2 — J r\ + rf — 2r \ r 2 
-)/au V 1 V 2 a 


r\r 2 


rf dr j sinffi d9\ d<j)\rf drz 




(rj -f r? “ |r 1 - r 2 |)n dr\ sin6^] d9\ d(j)\ri dri 


Now note that r\ + r 2 — | r\ — r 2 | = 2r 2 when r\ > r 2 , and ri A r 2 — |r| — r 2 | — 
2r\ when r\ < r 2 . This can be expressed as r\ + / 2 — In — r 2 | — 2 Min(n, rz ). 
where the function Min gives the smaller of its two arguments. Furthermore, the 
integrand is now independent of 9\ and <f >\, so integration over those variables gives 
4k , and we get 


(f 


47r€ 0 |rj - r 2 


■WO 



e 2Z(ri+/:,Mi Min(r), r 2 )n drpV drz 


where we have now written out the limits of integration for r\ and r 2 explicitly, 
in order to explain how to deal with the Min function. The presence of the Min 
function in the integrand forces us to break the integral over r 2 into two pieces: 
one for r 2 < r\ , for which Min(r|. r 2 ) — r 2 , and the second for r 2 > r\ , for which 
Minin * n) = n. Then the integral becomes 


(f I 


4jt | r i 


r 2 


:l*> 


Ac 2 / Z 

7tf{) \//() 


[f r 


-2Z(n 4*^2)/ a o . 


?'i dr\rf drz 







234 


Chapter 10 The Variational Principle 


Performing the two integrations over r 2 gives 


Wt- - - M) 

Ane Q \T\ - r 2 | 


4e 2 (Z 


x€o \a 0 


~4Zri/ao 


2 U[) a o 

2z ' 1 ~ 2Z 2 '"' ~ 4 2? 


+ e~ 2Zr ' ,a °^\ ridn 


+ / e 

J r, =0 


-*Zr^lL(EH + i ) r l dr 

4Z 2 \ a 0 / 1 


and integrating over r\ gives the final result: 


e 2 <? 2 5 

- ]W) = z - »Z 

f| — r 2 | 4tT(qU{) O 


( 10 . 11 ) 


Combining the results of Equations (10.9), (10.10), and (10.11), and using the fact 
that = 1. we get 


(j/\H\f) e~ ( , 5 

_—_ — -I Z 2 - 4Z 4- -Z 

<W> 4w€ 0 ooV 8 


( 10 . 12 ) 


Taking the derivative of the right-hand side with respect to Z and setting this 
derivative equal to Ogives the value Z„„„ for which {\l/\H\\f/)/(\J/\\(r) is a minimum: 


27 

27 __— n 

Z'Z'min u 


so Z m i„ — 27/16. As expected, 1 < Z min < 2. Inserting Z min back into the right- 
hand side of Equation (10.12) gives the estimate of the ground-state energy: 


w\Hm 

wm 


-77.5 eV 


For comparison, the measured ground-state energy of helium (i.e., the energy 
needed to remove both electrons) is 


£o = -79.0 eV 


As expected, e< j > £o- However, the error in the value of the ground-state 

helium energy from the variational principle is only 2%, which is an excellent 

approximation! 



Exercises 


235 


EXERCISES 

10.1 Suppose that the trial wave function j f(a)) happens to be exactly equal to the true 
ground-state wave function |t^o) for some value of a. Show that in this case, the 
estimate of the ground-state energy given by the variational principle will be equal to 
the true ground-state energy. 

10.2 Suppose that the trial wave function \f) used in the variational principle is orthogonal 
to the ground-state wave function of the Hamiltonian; (f 0 \f(a)) = 0 for all values 
of a. Show that in this case 


(f\H\f) > p 

if\f) ~ 1 

where £ } is the energy of the first excited state of H, 

10.3 (a) In order to use the variational principle to estimate the ground-state energy of 

the one-dimensional potential V (x) = Kx 4 , where K is a constant, which of the 
following wave functions would be a better trial wave function? 

i. f(x) = e '"* 2 

ii, f(x) = xe~ ax ~ 

Explain. 

(b) In order to use the variational principle to estimate the ground-state energy of 
the one-dimensional potential V(x) = Kx 3 for x > 0 with an infinite potential 
barrier at x = 0, which of the following wave functions would be a better trial 
wave function? 

i. f(x) = e~ ax ~ 

ii. f(x) = xe~ ax ~ 

Explain. 

10.4 A particle of mass m is in the one-dimensional potential given by V(x) = Kx 3 for 
x > 0, where K is a positive constant. There is an infinite potential barrier at x = 0, 
so V (0) = oc. Use the variational principle with the trial wave function \f) = xe~ ax 
to estimate the ground-state energy. 

10.5 Repeat the calculation in Example 10.1 using the trial wave function 

fix) = xe ax ' 

where a is the parameter to be varied. Is the final result a better or a worse approxi¬ 
mation to the true ground-state energy than the result of Example 10.1? 

10.6 (a) A particle of mass rn is in the one-dimensional potential given by V (x) = Kx 4 , 

where K is a positive constant. Use the variational principle with the trial wave 
function fix) = e~ ax ~ to estimate the ground-state energy. 

(b) The true ground-state wave function for this potential is a symmetric function of x, 
i.e., f{)( —x) = fo(x). Use the result of Exercise 10.2, along w ith an appropriately 
chosen trial wave function, to estimate the energy of the first excited state. 



236 


Chapter 10 The Variational Principle 


10.7 A three-dimensional spherically-symmetric harmonic oscillator has the potential 
V (r) = (1 /2 )Kr 2 . The full Hamiltonian is then 


H 


-ft 


2m 


i a 2 3 i a 2 

r 2 dr dr r 2 sin 2 9 3(p 2 


i a , a 

Hb —_— -sin 0 — 

r- sin 9 39 39 


+ 



[Note that the L 2 operator has been written out in terms of derivatives.] 

(a) Use the trial wave function ijr(r) — e~' ar to calculate an approximation to the 
ground-state energy of the harmonic oscillator. 

(b) The exact ground-state energy for the three-dimensional harmonic oscillator is 
E — (3/2 )hoj. What is the relative error in the estimate from part (a)? 

10.8 Here is another approach to solve for the ground-state energy of helium. 

(a) Begin with the Hamiltonian of Equation (10.5), but neglect the interaction be¬ 
tween the two electrons. Solve the Schrodinger equation in this case to derive the 
wave function of the two electrons and the energy. 

(b) Now add the interaction of the electrons as a perturbation: 


47re 0 |r] - r 2 | 

Use first-order perturbation theory to calculate the change in energy, and add 
this change to the energy derived in part (a) to give an estimate for the total 
ground-state energy. 

(c) Is the estimate in part (b) more accurate or less accurate than the estimate from 
the variational principle? 

10.9 (a) Singly-ionized lithium has a nucleus of charge -f3e and two electrons. Use the 
variational principle to estimate the ground-state energy. 

(b) Now consider a nucleus of charge Z {) e with two electrons. Use the variational 
principle to estimate the ground-state energy. 



CHAPTER 


Time-Dependent Perturbation 

Theory 



Although we first encountered the Schrodinger equation in full, time-dependent 
form, we have until now been concerned almost exclusively with solutions to the 
time-independent Schrodinger equation. There is a good reason for this: time- 
dependent problems are more difficult! However, here we return to the full, time- 
dependent Schrodinger equation. We will not attempt to solve it exactly; instead, 
we will develop a form of perturbation theory that can be applied to time-dependent 
problems. 

In Chapter 9, we examined what happens if we begin with a Hamiltonian Hq 
whose eigenfunctions | \j / f} ) and energies E n are known, and then add a small change 
in the Hamiltonian which is constant in time: H = Hq + H) . Here we consider 
what happens if the small change H\ is a function of time: 


H = H 0 + Hi(t) 


(Here Hq is still taken to be constant in time.) In practice H\ (t) will normally be 
produced by a time-dependent change in the potential V. Many different types of 
time dependence are possible for H\ (t). For example, consider an electron in a 
time-dependent electric field. One could imagine suddenly “turning on” the electric 
field, applying an oscillating electric field, or producing a very slow change in the 
electric field (called an adiabatic change). These three possibilities give the forms 
for H\ (t) shown in Figure 11.1. As we shall see, these different time dependences 
for Hi (t) produce different predictions for the time evolution of the wave function. 

The basic idea, as in Chapter 9, is that a small change in the Hamiltonian will 
produce a small change in the wave function. For a time-dependent perturbation, 
there can be a nonzero probability that a system initially in a particular eigenstate 
of Hq will undergo a transition into a different eigenstate of Hq. Our goal will be 
to calculate the probability of such a transition. 


11.1 ■ DERIVATION OF TIME-DEPENDENT PERTURBATION THEORY 

Before deriving the effect of adding a small perturbation to Hq, we need to derive 
some general results about the time evolution of the eigenfunctions of Hq itself. 
Suppose that at / = 0, we have an eigenfunction | \j/ n ) of Hq with energy E n . (We 
use Dirac notation to indicate that these are general eigenfunctions; they could 
be functions of position or spin states represented in column matrix form.) We 


237 




238 


Chapter 11 Time-Dependent Perturbation Theory 


H\(t) 

I 



FIGURE 11.1 Possible forms for the time dependence of H\(t) include a sharp change 
(top), an oscillating perturbation (middle), or an adiabatic change [i.e„ a slowly-varying 
H\(t) \ (bottom). 

will assume for now that H () is unperturbed. Recall from Chapter 3 that the time 
evolution of the eigenstates is given by 

\M)) — 

w here |t//„) represents the state at t = 0. (Although we derived this result only for 
wave functions that are taken to be functions of position, the derivation carries 
over to the case of abstract states in Dirac notation.) 

Now suppose we begin at t — 0 in the eigenstate and we would like to 
know the probability P„(t) that we are still in the same eigenstate at some later 
time t. This probability is 

Pn(t) = \WnW„m 2 

= \e- iE -'f h \ 2 M„W„)\ 2 


= 1 ( 11 . 1 ) 

Thus, a particle in a given eigenstate of Hu remains in that same eigenstate forever. 

Now consider a particle in an arbitrary state |t j/(t)), not necessarily an eigen¬ 
function of Ho. Since the eigenfunctions of H {) at t — 0 form a basis set, we can 
represent the wave function 1 1 //(/)} at any time t as a linear combination of these 
eigenfunctions: 

!)HO) = 


m 


( 11 . 2 ) 



11.1 Derivation of Time-Dependent Perturbation Theory 


239 


where the coefficients d m (t ) are functions of time since the wave function |t //(r)) 
evolves with time, but the |^ m ) ! s are not functions of time. Suppose a particle is 
in the state given by |i/r (t )) in Equation (11.2), and a measurement is made to see 
if it is in a given eigenstate of Hq, namely |i //„). Equation (11.2) can be used to 
determine the probability P n (t) of finding the particle in the state 

Pn(t) = KfUW>|- 



This gives the physical meaning of the expansion in Equation (11.2): the absolute 
value squared of the coefficient d n (t) provides the probability that the particle will 
be in the eigenstate |^„) at some time r. 

Time-dependent perturbation theory allows us to address the following problem. 
Suppose a system is initially in some state |i/),}, which is an eigenstate of Ho. At 
a later time we make a measurement to determine if it is in some other final 
eigenstate | yjtf) of H (l . From Equation (11.1), we know that this probability is 0. 
But now we “turn on” a small time-dependent perturbation H] (t). This means that 
the full Hamiltonian is now H = Ho + H\(t), and the states |t/r f ) and | xj/f) are 
no longer eigenstates of H . Therefore, Equation (11.1) is no longer valid: now 
there might be a nonzero probability that the system can begin in the state |i pi) 
and evolve into the state |i//,) (where |i/',) and \xpf ) are eigenstates of the original 
unperturbed Hamiltonian Ho). The calculation of this probability is the main point 
of time-dependent perturbation theory. 

Consider an arbitrary state | \p(t)), subject to the initial Hamiltonian H 0 plus a 
small time-dependent perturbation H\. The time-dependent Schrodinger equation 
in this case is 

8 \\p(t)) 

[Ho + H \{t)]\\jj {t)) = ih—z ——■ 01.4) 

at 

We can expand | if/(t)) in the form given by Equation (11.2). Now, however, we 
will define a new set of time-dependent coefficients c m {t) given by 

d m U) = c m (t)e~ iEl ” r/h 


so that | \[t(t)} is given by 

(11.5) 


where E m are the energies of the eigenstates for the original, unperturbed, time- 
independent Hamiltonian Hq. There is no deep significance to this change of vari¬ 
ables from d„,{t) to c„,(t); we do it to simplify the algebra. Note that because 





240 


Chapter 11 Time-Dependent Perturbation Theory 


\e |- = 1. the probability given in Equation (11.3) can be written as 


P„{t) = \c„(t)\ 2 


so we will be interested in determining c„(f). 

Inserting the expansion given by Equation (11 .5) into the Sehrodinger equation 
[Equation (11.4)] gives 

^c m (t)e- iE '" llh E m \xl, m ) + ]T H t U)c in U)e iE '° l h \^ m ) 


m 


m y 


m 


dt'm 

hT 


-i E m t/h i 


x ! f m) + i 


">E 


(-/ E m /h)c m {t)e 


-i E„,riti 




The first term on the left-hand side and the last term on the right-hand side cancel, 
since together they just represent the unperturbed Sehrodinger equation, and we 
get 


H l (t)c m (t)e~ ih: " ,,h W,n) = iti J2 

m m 

Applying (\[r n \ to both sides of this equation and recalling that {^ n 
gives 



cy = j_ 

dr ih 


y{\ff„\H\(t)\ijr m )c m (t)e 


if E„-E m )t/h 


( 11 . 6 ) 


Note that we have made no approximations so far; Equation (11.6) is exact, and in 
principle it gives the evolution of all of the coefficients The problem is that 
it gives the time derivative of c„ as a function of all of the other coefficients r,„: in 
many cases this could be an infinite number! 

Making further progress requires using the fact that H](t) represents a small 
perturbation to //<>. Suppose that the system begins in an initial eigenstate \ \J/,) for 
which Cj = 1 and c m = 0 for in -£■ i, and we would like to calculate the probability 
that it evolves into some final eigenstate |i///■). If the perturbation is small, then it 
is a good approximation to assume that the system does not evolve very far from 
its initial state, so that at any later time we still have c, ~ 1 on the right-hand ■'ide 
of Equation f 11.6), while all of the other c m 's are so small in comparison that they 
can be ignored. With this approximation (and identifying the state n with the final 
state /), Equation (11.6) becomes 

dt iti 




11.1 Derivation of Time-Dependent Perturbation Theory 


241 


which can be integrated to give c/: 





f — Ei )t/h 


(11.7) 


This is the fundamental equation of first-order time-dependent perturbation theory 
[the time-dependent analog to Equation (9.19)]. Here and | \ftf) represent 
eigenstates of the unperturbed Hamiltonian //(> with /' 56 /. (Note that |t/r ( ) and |i (tf) 
are assumed to be orthogonal and must be correctly normalized!) The probability 
that the system beginning in state | ^r-,) at time ?, will end up in state j rf/f) at time 
tf when a perturbation H\{t) is applied is 


pu -* /) = 1 c f \ 2 = 1 f' dtw f \Hi{t)m^ Ef - Eim 

fi J i, 


It will often be the case that the perturbation H\ (r) can be factored into a time- 
independent operator 7/ and a time-dependent piece /(?) which does not operate 
on the wave functions: 


Hi(0 = 7{|/(f) 


In this case the inner product in Equation (11.7) can be pulled outside of the time 
integral, giving 


tf = ^{ir f \U x m Jp dt 


and the transition probability becomes 


1 

rh 

2 

pu^ f) = - 2 \w f \U\m\ 2 

n 

J dt 

\ 


( 11 . 8 ) 


(11.9) 


This allows us to make general statements about the time evolution of the system 
even if we only know the time behavior of the perturbation f{t) and know nothing 
about 

We now examine how several forms for this time dependence /(f) translate 
into the time dependence of the transition probability. Consider first the case of a 
step-function perturbation of the form H\ = H\ f (/), in which the perturbation is 
“turned on” with constant magnitude at t = 0 (Figure 11.1, top). We take /(f) = 0 








242 


Chapter 11 Time-Dependent Perturbation Theory 


P(i-*f) 

t 



FIGURE 11.2 The probability P(i —*■ f) as a function of time tf for a transition in the 
case of a step-function perturbation, 

for t < 0 and fit) = I for t > 0. Equation (11.8) gives, for this case, 

c .f 

Note that E/h has units of frequency, so it is convenient to define the frequency 
coq given by 

roo = (Ef — E t )/h 
Then Equation (11.10) integrates to 

n(i){) 

Finally, the probability that the system has evolved into the state \ij/f) at the time 
t f is 



This gives the general time dependence of the transition probability for any per¬ 
turbation which is a step function in time. The behavior of P(i —* f) is shown 
in Figure 11.2. If tf is small compared with 1 /ojo so that cootf <§; 1, the transi¬ 
tion probability increases as t~ . Eventually, however, this probability oscillates as 
shown. 

Second, we consider an oscillatory perturbation of the form H\ = H \ cos (cot) 
which is “turned on" at time t = 0 (Figure 11.3). Equation (11.8) gives, for this 
case, 

c f — f\EL\\fi) [ dt cos((Dt)e' iE '"~ E,)l/h 
ifr ' Jo 


= f dt e i(Ef - Ei)t/n (11.10) 

ih Jo 


( 11 . 12 ) 




11.1 


Derivation of Time-Dependent Perturbation Theory 


243 



FIGURE 11.3 The perturbation H\ — 'H\ cos(oit) is “turned on” at time t — 0. 

As in the previous example, define the frequency too to be given by 

a) 0 = ( E f - Ej)/ti 
while cos(mf) can be expanded in exponentials as 


cos (cot) = 


£ iwt _|_ e -ia>t 
2 


so that Equation (11.12) becomes 


c f 


ih 


Wf\Hi\rlfi) f f dt-{e Hm+u,)t f + e i(u>0 ~ w)t f) 
Jo 2 


which integrates to 


c f = rz^f\HvWi)]: 
in 2 


1 


i (coo + (d) 


(e 


i (ajo+oj)t f 


l) + 7 


1 


/ ((Wo - w) 


^ e i(COQ-U>)tf _ j j 


(11.13) 

This expression is rather complex, but it can be simplified for certain values of 
co. Assume first that Ef > E,, so that too > 0. Then if to is close to too (which is 
where the transition probability is largest), the second term in Equation (11.13) 
dominates the first [since, in this case, 1 /(a>o — a>) l/(a>o + a»)]. Dropping the 


244 


Chapter 11 Time-Dependent Perturbation Theory 


P(i —* /) 



w„ 


FIGURE 11.4 The transition probability P(i -> f) at a fixed time as a function of the 
applied frequency to for a perturbation which varies as cos (cor). 

first term and calculating |c/■ | 2 to derive the transition probability P(i —*• /), we 
get 



Conversely, if £/- < £,, then too < 0. In this case the transition probability is 
largest when to is close to —coo, in which case the first term in Equation (11.13) 
dominates the second, and we get 



These general results are applicable to any sinusoidal perturbation independent 
of the value of (\J/f \'H \|^,). Consider the case ojq > 0. If w^e fix the final time t > 
and examine how the transition probability varies with the applied frequency to, 
we see that it reaches a maximum at to = a>o; this is similar to the phenomenon of 
resonance in classical systems. As to moves away from too, the transition probability 
decreases in an oscillatory fashion (Figure 11.4). 

As we have emphasized. Equations (11.11) and (11.14)-( 11.15) give the general 
time evolution for any perturbations of the form H\(t) = 'H\ /(/), when /(/) is 
a step function or a sinusoidal function. These cases represent two of the three 
types of time dependence shown in Figure 11.1. The third case (an adiabatic, i.e.. 
slowly-varying perturbation) is left as an exercise (see Exercise 11.4). Of course, 
these three cases do not exhaust all possibilities; many other forms for the time 
dependence of H\ (?) are conceivable (see, for example. Exercises 11.1 and 11.2T 




11.1 Derivation of Time-Dependent Perturbation Theory 


245 


We now apply these results to two examples. 

Example 11.1. A Hydrogen Atom in an Electric Field Turned on at / = 0. 

A hydrogen atom is in its ground state. A uniform electric held with magnitude 
£ aligned in the positive z direction is turned on at time t — 0 and left on. At 
some later time, t /• > 0, what is the probability that the atom will be in each of the 
following excited states? 

(a) n — 2, I = 1, nij = — 1 

(b) n = 2, / = 1. /»/ = 0 

(c) n = 2, l = 1, nii = +1 

This perturbation is of the form H\(t) — with f(t) a step function, 

so the transition probability P(i —» /) is given by Equation (11.11). We need 
to calculate (0/|'Hi|0 / ). For the hydrogen atom in an electric held, the time- 
independent factor in the perturbation, U\. is 

H\ = et'z 

just as in the case of the Stark effect in Chapter 9. Then 

{\l/f\H\\ijfi} = eStyflzWi) 

= e£{f f \r cos 9\\l/,) 

The initial wave function is the ground state of hydrogen: 

1 / 1 \ 3/2 

IV/,) ^ioo( r i 4>) = “7= ( — ) e t,a ° 

s/7T \aoJ 

In case (a) the final state is 

l< h) 02 i-i (r,0,<f>) = —( —) (L.\ e ~ r/2ao sm0e~ ,4> 

Xy/n \ ao J \ ciq ) 

so that 

e£ c^ r T r 2n 

(tM'ftillM =-^ I II (>' 2 dr sin9d0d(p)(r 2 e~* r / 2ao )(s'm9cos9)e“ 1, 

87r a () J, —a Jo -q 

The integral over tf> gives zero, so the final transition probability is 

P(i — /) = 0 


Now consider case (b) for which the final state is 

\ 3/2 


I iff) -> 02io(r. 0. 0) 


4v27t 


\«0 / \ a 0 J 


Uh) n 


cos 0 


so that 

{0/1%! |0/) 


_e£ r 00 r r” 2 

4t/2tuiq Jr-o J<t>=o 


2 dr sin 6 d$ d<t>)(r 2 e ?r/ )(cos 2 9) 




246 


Chapter 11 Time-Dependent Perturbation Theory 


This can be integrated to give 

I28\/2 , 

(V//!^i|^,) = .^. etm 

Substituting this back into Equation (11.11) gives the final transition probability: 

131072 e 2 £~al . ,/£•> — £i 


PU-* f) = kvl- 


59049 (£ 


e-£~itn . / £2 — £| \ 

ES - £| ) 2 s,n V 2n ') 


Here £| and £2 are the energies of the ground state and first excited state of 
hydrogen, res pec t i ve 1 y. 

In case (c) we get the same answer (for the same reason) as in case (a): 

PU -* /) =0 


Now consider an example involving spin eigenstates and an oscillating potential. 


Example 11.2. Spins in an Oscillating Magnetic Field. 

An electron is in a strong, uniform, constant magnetic field with magnitude B„ 
aligned in the —c direction. The electron is initially in the state | f). A weak, 
oscillating magnetic field in the ,v direction of the form 


Bj. = B 1 eos(rt>Dv 

is turned on at t — 0. Calculate the probability that the electron is in the state | j) at 
some later time t , assuming we are near resonance, so Equation (11.14) or (11.15) 
applies. 

Since the constant magnetic field is Bo = —BoZ, the spin-up state | f) has 
energy E = — //«£<). while the spin-down state | i) has energy £ = -f/i W £ ( ). so 
that the initial and final energies are 


Ej — —p/j B{) 

E/ = +p h B{) 

Then coq > 0, so Equation (11.14) applies with «>0 given by 

ojo = (£ ; - Ej )/ti — 2 n H £ 0 jh 

When the perturbation H\(t ) is written as H\(t) — H\ cos (tot ), the expression for 
H 1 is 

EL\ = —p • B| 

= -BiM.v 

= - fl '(-T £ ) s ' 

= B\Hbg x 




11.2 Application: Selection Rules for Electromagnetic Radiation 


247 


This gives 


= a \Bifi B a x \ t) 


or in matrix form, 

Wf\n i w,) = Bm B (o i)^ o)(oj 

= B\B-b 

Then we obtain, using Equation (11.14), the final result: 

D .. ,, B}h 2 b s\n 2 [(a) - a> {) )t/2\ 

4 h 2 l(ce — o> 0 )/2| 2 

with 


o>() = 2 figBo/h 


11.2 ■APPLICATION: SELECTION RULES FOR ELECTROMAGNETIC 
RADIATION 

In Chapter 6 we developed a model for the hydrogen atom. In this picture the 
energy levels E\, Ej,..., E„ correspond to the principle quantum numbers, and 
the electron can drop from a higher energy level Ej into a lower energy level E / by 
emitting a photon with angular frequency o> = (£, — E f)/h (Figure 11.5, left). In 
Chapter 9 we showed that various internal interactions and corrections split some 
of the degenerate states in a given energy level producing slightly separated energy 
levels. However, we still expect that transitions will occur between these energy 
levels with the corresponding emission of a photon with the correct energy (Figure 
11.5, right). This picture is essentially complete in the sense that all observed 
spectral lines correspond to a predicted pair of hydrogen energy levels. However, 
the reverse is not true: there are spectral lines predicted by this picture which are 
very weak or nonexistent. Apparently, something is preventing some transitions 
from occurring; these are called forbidden transitions. For example, the transition 


n = 2 


ff/2 

Pm 


’ 1/2 


FIGURE 11.5 Left: The electron can drop from energy level E, to energy level E / with 
the emission of a photon of angular frequency o> = (£,- — E/)/h. Right: The actual energy 
levels display fine structure, but transitions occur in the same way. 



248 


Chapter 11 Time-Dependent Perturbation Theory 


from the state n = 2, l = 0 to the state n = 1, l — 0 is not seen to occur via 
emission of a single photon. We will now use time-dependent perturbation theory to 
understand why. The rules we will derive that determine the allowed and forbidden 
transitions are called selection rules. 

We will actually consider photon absorption rather than emission, since the 
calculation for absorption is more straightforward. Since emission is just the time 
reverse of absorption, if a given absorption process is allowed, the corresponding 
emission process will also be allowed, and the same goes for forbidden processes. 
Consider an electromagnetic wave incident upon the electron in a hydrogen atom. 
Recall that this wave consists of an oscillating electric field and an oscillating 
magnetic field, so the obvious question is: which is more important, the interaction 
of the electron with the magnetic field of the wave, or with its electric field ? 
Classically, recall that the force experienced by an electron in an electric field £ 
and magnetic field B is 


F = ~e\£ + v x B| 

Further, the magnitudes of the electric and magnetic fields in an electromagnetic 
wave satisfy the relation 


£ = Be 

Combining these two equations, we see that the force due to the electric field /V 
and the force due to the magnetic field /•'« satisfy 

F £ ~ e 

which is much less than one as long as the electron is nonrelativistic. This is a 
standard result from classical electromagnetism: in examining the interaction of 
an electromagnetic wave with matter, it is the electric field of the wave, not its 
magnetic field, which dominates the interaction. 

The electric field due to an electromagnetic wave propagating in the k direction 
can be written in the form 


£ = £V' <k ' r ~ w/) = £ Q e ik - r e~‘ 1 ’" ( 11 . 16 ) 

where £$ points in a direction perpendicular to k. 

The dependence of £ on the position r is sinusoidal, but we can simplify this 
dependence by noting that we will be interested in frequencies lor which kr <$? 1- 
In the hydrogen atom, for example, a transition from one bound state to another 
corresponds to a photon energy E < 13.6 eV. which corresponds to X > 1.5 x 
10~ 8 m. In comparison, the typical “size ” of the hydrogen atom over which we 
need to apply this perturbation is a o ~ 10~ 10 m. Thus, it is a good approximation to 
take £ to be roughly constant over the radius of the hydrogen atom, corresponding 
to kr <$? 1 (Figure 1 1.6). 



Application: Selection Rules for Electromagnetic Radiation 


249 



FIGURE 11.6 For electromagnetic radiation with wavelength k c/ () , the electric field 
can be taken to be constant in space over the radius of the atom. 


Mathematically, this approximation corresponds to expanding out the exponen¬ 
tial in Equation (11.16) in the form 

£ = £ {) e~ i0)l 1 + ik . r + ~(/k . r) 2 + -• - (11.17) 

and retaining only the first term in the expansion: 

£ = £ {) e~ Uot (\) (11.18) 

This is called the dipole approximation . 

Suppose we begin with the hydrogen atom in some initial bound state ] t//i) and 
apply the electric field in Equation (11.18). Then the change in the energy of the 
electron due to this electric field is 



and the probability that the electron will end up in some other bound state |t//y) 
derived from time-dependent perturbation theory (Equation 11.9) is 



The time dependence of this probability is irrelevant (which is why the limits of 
integration in the time integral have been left unspecified). The important thing is 
whether or not the transition can occur at all, which is completely determined by 
whether or not the quantity {i/f \£o • r1 1 fa) is zero. 

We now' take the initial state to be the hydrogen wave function with quantum 
numbers /?/, /,, and m t (all of the m \s here will refer to mp we temporarily drop 
the / subscript for clarity) and the final state will have quantum numbers n If , 
and m f. For now we will take £q to be in the z direction, so that 

£{) • r =s £qz = £{)r cos 6 



250 


Chapter 11 Time-Dependent Perturbation Theory 


This choice for £q indicates that the light is polarized in the z direction. Since the 
value of / does not depend on the choice of coordinate axes, we do not expect the 
selection rules for / to depend on the choice of polarization. This will not be true, 
however, for the selection rules for m/. 

The inner product which determines whether or not the transition can occur 
becomes 

(ff\£o ' r|<M = £o J (r 2 dr ain9 d9d4>)R* flf (r)Yl n f f * (9, <p)(r cos 6) R„ i i i (r)Y™‘ 
Consider the integral over 6 and 0: 

J (sin 9 d9 d<t>)Y"' f *(9. 0)( cos 9)Y” h (9, 0) 

It can be shown that the spherical harmonics have the property that 

Y, m (e, 0)cos<9 = aYU^e, 0) + bY?_ l (9. 0) 

where a and b are constants that depend on the particular values of / and m. This 
relation allows us to rewrite the angular integral as 

= J(sinOd0 d<p)Y™ f *(9,4>)\aY^ { {9, 0) + bY”L x {9, 0)J 

( 11 . 20 ) 


Finally, recall the orthogonality relation for the spherical harmonics: 

J (sin 9 d9 d(f))Y™ *{9, <p)Y l m (9, 0) = 0, unless 1=1' and m = m' 

Applying this orthogonality condition to Equation (11.20), we see that the integral 
will vanish (and therefore the transition probability in Equation (11.19) will be 
zero) unless 

//=// + 1 or //=/, - 1 (11.21) 

and 

m f = nii 

These are the correct selection rules for light polarized in the z direction, but only 

the selection rule for/(Equation 1 1 . 21 ) carries over to arbitrary polarizations, since, 

as we have argued, anything involving / must be independent of the coordinate 
system. The selection rule we have derived for m applies only to this particular 



11.2 Application: Selection Rules for Electromagnetic Radiation 


251 


polarization state; for light polarized in the x or y directions, we obtain instead 
(Exercise 11.8) 

m f = m, + 1 or m = m, — 1 

The full selection rules then can be expressed in the succinct form 

A/ = ±1 


and 


A mi = 0, ± 1 


where A/ = //—/, and A m/ = m / — Finally, there is an additional selection 
rule for the total angular momentum quantum number j ; 


A j = 0. ± 1 


( 11 . 22 ) 


with the single exception that /, = 0 —> jf — 0 is forbidden. We will not give 
a rigorous proof of the j selection rules but rather explain their physical origin. 
The photon has spin 1, so if it is absorbed by an atom with initial total angular 
momentum quantum number j ,, the two angular momenta can couple to give 
total angular momentum of y, — 1, j ), or j) + 1, from the rules for adding angular 
momentum in Chapter 8. These, therefore, are the possible values of jf, giving the 
selection rule in Equation (11.22). The single exception occurs if j, — 0, which 
can couple to the spin-1 photon only to give j/ — 1. This is why the transition 
ji — 0 to jf = 0 is forbidden. (Of course, j = 0 never occurs in the hydrogen 
atom, but it does occur in multielectron atoms.) 


Example 11.3. The Allowed Transitions in Hydrogen from n = 3 to n — 1. 

An electron in the n =3 state of hydrogen emits a photon and drops into the ground 
state. What are the allowed transitions? 

The state n = 1 has only one possible value for/, namely, / = 0 and one possible 
value for j, namely, j = 1 /2. In the spectroscopic notation introduced in Chapter 
9, this state is written S|/ 2 . 

The state n = 3 has the following possible / and j states: 

1 = 0. j = 1/2 

/= 1. j = 1/2. y =3/2 

1 = 2. j = 3/2, j = 5/2 

The selection rule for / tells us that A/ = ±1. Since the final state has / = 0. and 
/ cannot be negative, the only allowed initial state is / = 1. 





252 


Chapter 11 Time-Dependent Perturbation Theory 


For l = 1 in the initial state, we can have either y = 1/2 or j — 3/2. Since 

y = 1/2 in the final state, the y = 1/2 initial state corresponds to Ay = 0, and 

the y =3/2 initial state corresponds to Ay = — 1. The selection rule for j is 
Ay = 0, ±1, so either of these initial j states is allowed. 

The allowed transitions, therefore, are 

/ = 1, y = 1/2 -* / = 0, y = l/2 

/ = 1, j =3/2->/ = 0, 7 = 1/2 

or, in spectroscopic notation. 


P\/2 -> St/2 

f*3/2 5|/2 


Transitions which are not allowed by our selection rules are called forbidden 
transitions, but this is somewhat misleading. Such transitions can sometimes occur 
but at a much slower rate. There are two possible ways of evading the selection 
rules: first, a transition may occur through one of the higher-order terms that were 
dropped in Equation (11.17). For example, a transition occuring through the term 
linear in k • r is called an electric quadrupole transition. Second, we have ignored 
the interaction between the magnetic field of the electromagnetic wave and the 
electron in the atom, but this interaction can lead to a magnetic dipole transition. 
or a higher order magnetic transition. Therefore, the term “forbidden” transition 
really means “forbidden to electric dipole radiation.” 


EXERCISES 

11.1 The electron in a hydrogen atom is initially in the ground state. At t = 0, a homo¬ 
geneous electric field aligned in the z direction is turned on. The magnitude of the 
electric field decreases exponentially: 

£ = £oze- ,/x 

where £q and r are constants. A measurement is made at tf — -foe; what is the 
probability that the electron will be in the first excited state? 

11.2 A system is in an eigenstate |^) with energy The perturbation 

is turned on at == “~qg and left on until if = -fee. Here H] is independent of 

time, and a is a constant. Show that at tf = 4-oc, the probability that the system has 
evolved into the eigenstate I1/7) with energy Ef is 

PH -* /) = ^|{^/|Hil^0|V (£ /-- £ '> 2/2 * v 

n"a z 



Exercises 


253 


11.3 An electron is in a strong, uniform, constant magnetic field with magnitude B {) aligned 
in the +x direction. The electron is initially in the state | ~>) with* component of spin 
equal to +h/2. A weak, uniform, constant magnetic field of magnitude B\ (where 
B\ Bq) in the 4 -z direction is turned on at t = 0 and turned off at t = ?<>. Let 
P(i —► /) be the probability that the electron is in the state | <—} with x component 
of spin equal to —hf 2 at a later time tf > to. Show that 

P(i /) = (BJBo) 2 $m 2 U.i B Boto/h) 


11.4 Consider a time-dependent perturbation H } (?) which is adiabatic, i.e., slowly vary¬ 
ing. The system is initially in the state 1 1 // ; } at t f = —oo. The potential is turned on, 
and we wish to derive the probability that the system will be in the state | ij/f} at 
some later time tf. Write down the standard expression for c f in this case and use 
integration by parts to break the expression into two terms, one of which contains 
dH\/dt. Since H\ is slowly varying, this term may be taken to be 0. Then use the 
fact that H\ (—oo) = 0 to derive the final expression for the transition probability: 

n/ . ^ \(i/ f \H\(tj)\ij/i )\ 2 

J (E, - Ef) 2 

11.5 An electron is in a strong, static, homogeneous magnetic field with magnitude B () in 
the z direction. At time t = 0. the spin of the electron is in the -f z direction. At t = 0 
a weak, homogenous magnetic field with magnitude B\ (where B\ <$C B {) ) is turned 
on. At t — 0 this field is pointing in the x direction, but it rotates counterclockwise 
in the x-z plane with angular frequency co. so that at any later time t this field is at 
an angle cot relative to the jc-axis: 



Calculate the probability that at a later time t f the electron spin has Hipped to the 
—z direction. Do not assume anything about the particular value of to, 

11.6 A particle with mass m is in a one-dimensionai infinite square-well potential of width 
a, so V(x) = 0 for 0 < x < a , and there are infinite potential barriers at x — 0 and 




254 


Chapter 11 Time-Dependent Perturbation Theory 


x = a. Recall that the normalized solutions to the Schrodinger equation are 



with energies 

_ hW 

2 ma z 

where /? = 1,2, 3,_ 

The particle is initially in the ground state. A delta-function perturbation 

Ux = KS (x - “ ) 

(where K is a constant) is turned on at time t = and turned off at f = f t . A 
measurement is made at some later time t 2 , where t 2 > t\. 

(a) What is the probability that the particle will be found to be in the excited state 
n = 3? 

(b) There are some excited states n in which the particle will never be found, no 
matter what values are chosen for t { and r 2 - Which excited states are these? 

11.7 A hydrogen atom is in the ground state. At t = 0 an electric field with magnitude 
£ is turned on. At t = 0 the electric field points in the x direction, and it rotates 
counterclockwise in the x-y plane with angular frequency a> (i.e., at any later time t 
the field is oriented at an angle cot relative to the x-axis). This rotating field causes 
the atom to undergo a transition to an/? = 2 state. Determine which of the /, m ( states 
are possible final states and which are impossible. 

11.8 (a) Consider an electromagnetic wave polarized in the x direction, incident on a 

hydrogen atom. Show that in this case, the selection rules for mi are 

A mi = ±1 

(b) Repeat this calculation for an electromagnetic wave polarized in the y direction. 

11.9 A hydrogen atom in the n — 4 state emits electric dipole radiation and drops into 
the n = 3 state. Determine all possible transitions in terms of their initial and final 
values for / and j. Express the answer in spectroscopic notation. 

11.10 (a) The electron in a hydrogen atom is initially in the state n = 5, / = 0, j = 1/2. 

The atom emits electric dipole radiation and drops into an n = 3 state. Determine 
all /, j states which are possible final states. 

(b) An electron in a hydrogen atom is initially in the state n = 5, / = 2, j = 5/2. It 
emits electric dipole radiation and drops into the state?? =4,/ = l\,j — j\ . From 
this state, it emits electric dipole radiation again and drops into the hydrogen 
ground state. Determine l\ and j\. 

11.11 An electron is contained in a three-dimensional rectangular box given by 0 < x 5 * 

£ V < b, and 0 < z < c. The solutions of the Schrodinger equation are specified 

by the quantum numbers n x , n y , and n z . Recall that the normalized wave function is 

. (n x nx\ . /n yiry \ . /n z nz\ 

* fx - y - z> = v ^ sin (—) s,n (~r) Sl " (—) 



Exercises 


255 


with energy 



where n x , /i v , and n : are positive integers. The electron is initially in the state 
n Xf n V7 n : . An electromagnetic wave is incident polarized in the y direction, so that 
the electric field vector is given by: 



where E (} is a constant vector in the y direction. Use the dipole approximation and 
time-dependent perturbation theory to derive the selection rules for the electron to 
absorb the radiation and end up in the final state n , n\ 7 n\. 




CHAPTER 


12 


Scattering Theory 


In this chapter, we return to the theory of scattering, i.e., the behavior of a particle 
incident on a fixed potential. The one-dimensional case was examined for step- 
function potentials in Chapter 4. In that case it was possible to solve the Schrodinger 
equation exactly. Here we extend the discussion to the case of three-dimensional 
potentials. In most cases the Schrodinger equation for scattering in three dimen¬ 
sions cannot be solved exactly, so approximation methods must be used. This 
chapter deals with two of these approximation methods: the Born approximation 
and the method of partial waves. The Bom approximation can be applied when 
the energy of the incident particle is much larger than the potential from which it 
scatters. While the method of partial waves is applicable to any scattering problem, 
it takes a particularly simple form in the limit where the energy of the incident 
particle is low. 

2.1 ■ DEFINITION OF THE CROSS SECTION 

Before examining these quantum mechanical approximations, we will review some 
of the concepts of scattering from classical mechanics. Suppose that a region of 
space is filled with targets having a number density n. A particle enters this region 
and travels a distance L (Figure 12.1). Clearly, the probability P that the particle 
will strike one of the targets is proportional to the distance L that it travels, and it 
is also proportional to the number density of targets n. so it is possible to write 

P = nLo (12.1) 

where o is a constant of proportionality that depends on the nature of the targets. 
Since P is a probability, it must be a dimensionless number, while n has units of 
1 /length 3 and L has units of length. Hence, a must have units of length 2 , or area. 
The quantity o is called the cross section for scattering. In a classical scattering 
problem in which the incident particle physically collides with the targets, o is 
just the cross-sectional area of each target, i.e.. the area that the incident particle 
“sees” head-on. This is not the case (even classically) when the incident particle 
and the targets interact via a long-range force. 

Now suppose that instead of a single particle, a large number of i ncident particles 
Nj all travel a distance L through this region of targets, and let N s be the number of 
these particles that scatter off of one of the targets. In this case P gives the fraction 


257 



258 


Chapter 12 Scattering Theory 



FIGURE 12.1 A particle moves a distance L through a region of space having a number 
density of targets n. 



FIGURE 12.2 A spherical coordinate system with the polar axis along the direction of 
motion of the incident particle. 

of particles that scatter, so that 

P = N s /Ni 

which implies that 

( 12 . 2 ) 

Often, however, it is important to know not only the total probability for scatter¬ 
ing but also the probability that the incident particles are scattered in a particular 
direction. In order to quantify this idea, we set up a spherical coordinate system 
with the polar axis aligned along the direction of motion of the incident panicle 
(Figure 12.2). Consider a small solid angle of size dQ in the 6, 0 direction: 



d£l — sin 9 d9 d<p 




12.1 Definition of the Cross Section 


259 


Incident particle 
—--— --———»- 


FIGURE 12.3 The differential cross section. do/dQ. determines the probability that an 
incident particle will be scattered in the 9, <p direction. 

If N s is the total number of particles scattered (as in Equation 12.2), let dN s be 
the total number of particles scattered into the small solid angle di 2 in the 0 
direction, where dN s will be a function of the scattering direction and will be 
proportional to d£2. Then the equation analogous to Equation (12.2) is 


dN, = NjiiL—dS 2 
dn 



Thus, do/dQ, which is a function of 9 and 0, determines the probability that 
the incident particle will be scattered in the 9, 0 direction (Figure 12.3). For 
instance, 6 ~ 0 corresponds to no scattering, while 9 = n represents scattering 
directly backwards. The quantity do/dQ is called the differential cross section 
and, therefore, a is often called the total cross section. Given da/dQ , the total 
cross section is just the integral of the differential cross section over all angles: 


”/X(S) sine ^ 


It will frequently be the case that the scattering potential is spherically symmet¬ 
ric. In this case the system as a whole (incident particle plus scattering potential) 
has azimuthal symmetry, i.e., there is no preferred 0 direction. On the other hand, 
while the potential is symmetric with respect to 9, the system as a whole is not. 
since the direction of the incident particle defines a special direction with respect to 
the 9 coordinate. Thus, fora spherically-symmetric scattering potential, we expect 
do/dQ to depend on $ but to be independent of 0. 

As an example of how these ideas work, we will now calculate a classical 
scattering cross section; this is not a quantum mechanical calculation! 


Example 12.1. Classical Scattering from a Hard Sphere. 

Suppose that we are shooting particles at a solid sphere of radius a, where the 
size of the incident particles is negligible compared to the size of the sphere. The 
particles reflect off of the sphere elastically, so that the angle of incidence equals 
the angle of reflection. We now calculate the differential cross section, do/dQ, 
and the total cross section a. 




260 


Chapter 12 Scattering Theory 



FIGURE 12.4 A particle scatters from a hard sphere of radius a. The scattering angle is 
9, and the angle of incidence and angle of reflection are both a. 



FIGURE 12.5 The particle scatters off of a cross-sectional area consisting of a ring of 
radius s, width ds: (left) side view, (right) head-on view. 


As usual, let 9 be the scattering angle for the particle. When the particle scatters, 
the angle of reflection equals the angle of incidence; we let a be both of these angles 
(Figure 12.4). The particle scatters off of a cross-sectional area consisting of a ring 

of radius s and width ds (Figure 12.5). The cross-sectional area of this ring is 

do — 2ns ds 

and from Figure 12.5, we have 


s = a sin a? 







12.1 Definition of the Cross Section 


261 


These two equations give do in terms of a: 

do = (2n)(a sma)(a cosada) (12.3) 

However, we want to express everything in terms of the scattering angle 9. Note 
from Figure 12.4 that 

9 — ji — 2a 

Substituting this into Equation (12.3) gives 

do — (2n)a 2 \{s\r\9)/2\(d9/2) 

Note that if the particle strikes this area do. it scatters into the solid angle dQ. = 
(sin 9 d9)(2n). where the 2n comes from the fact that we have integrated over 4>. 
Then the differential cross section is 

do ^ 2ncr\{sm9)/2](dd/2) 
d£l 2n sin 9 d6 

- a l 

4 

Thus, the differential cross section in this case is independent of both 9 and <p. 
The total cross section is 

c.r 

Je=o J(p~o \dQ ) 

fit p 27t q 2 

= / i —sin H dO d(p 

J<p -o 4 

= na” 

This result makes sense physically because no 1 is the cross-sectional area of the 
sphere, i.e., a particle striking the sphere sees, in projection, a circle of radius a 
and area na 2 . 


The general scattering problem in quantum mechanics is the calculation of 
do/dQ for a particle incident on an arbitrary potential V. In principle, the 
Schrddinger equation should be solved and the resulting wave function used to 
find the cross section. In practice, the Schrodinger equation for scattering prob¬ 
lems is usually difficult to solve, so approximation methods must be used. There 
are two cases for which the problem is much simpler. In the limit in which the 
energy of the particle E is much larger than the potential V we can use a variant 
of perturbation theory called the Born approximation. In the opposite limit, when 
£ « V,we utilize the method of partial waves. We now discuss these in turn. 



262 


Chapter 12 Scattering Theory 



FIGURE 12.6 A particle scatters from the potential V (r) with initial wave vector k, and 
final wave vector k;. 


12.2 BTHE BORN APPROXIMATION 


Consider an incident particle with energy E scattering off of a potential V(r). 
(Although the potential will often be taken to have spherical symmetry, this is 
not essential in using the Born approximation, so we begin with the most general 
case.) We assume that the potential is “weak” in the sense that E V(r), which 
allows us to use the results of time-dependent perturbation theory from the previous 
chapter. We will assume further that V (r) 0 when r -» oo so that the potential 
can be ignored for sufficiently large r. 

Assume that the incident particle has wave vector k,. and it scatters with final 
wave vector kf (Figure 12.6). These correspond to initial and final energies of 


Ei 


h 2 kf 

2m 


E f 


h 2 k} 

2m 


We have assumed that the potential vanishes for large r, so at locations far from the 
potential the time-independent wave functions correspond to free particles. These 
time-independent wave functions (from Chapter 4) are 

in = e ik ' r 

f f = e ik r r 

The scattering potential is assumed to be constant in time. However, we can think 
of this problem in terms of time-dependent perturbation theory. Initially, when the 
particle is far from the potential, it “sees” V = 0. Then, as the particle enters the 
region where V / 0, the particle experiences the potential and scatters. Finally, as 
the particle leaves the region of the potential, it once again “sees” V = 0. Hence, 
we will assume that the potential is “turned on” at some arbitrary time, which we 
will take to be t — 0, and we wish to know the probability that the particle has 
scattered from the state i/fi to \j/f by the time t. Formally, since the potential is 
being treated as a small perturbation, this probability is just given by the result of 



12.2 The Born Approximation 


263 


time-dependent perturbation theory from Chapter 11: 


P(i 




dt'{\Jff\V(r)\\lfj)e i{Ef 




h 


■\Wf\V(r)m\ 




f dt' e i{E ’- Ey 

Jo 


/h 


h 


i ( E f - Ei) 


y i{E f — Ej)tfh 


0 


h 


1 •> 4 h- . , ({Ef - Ej)t\ 

T |(^/iV(r)|^>| 2 —-— s.n- ( . - - } (12.4) 


(E f - E^ 2 


2 h 


Although this expression is formally correct, there are several further steps before 
this result can be used to derive an actual scattering cross section. 

First, the wave functions must be normalized. This represents a problem, since 
a function of the form e lk ' T does not have a well-defined integral over all of space. 
To resolve this, we put a box of volume V around the incident and scattered wave 
functions. This may seem like a bit of a fraud, but as long as the volume of the 
box is much larger than the system under consideration, it cannot affect the final 
results. In this case the normalized wave functions become 


since, for example, 


1 

1Ar = — 


i kj r 


Vv 


ikf *r 


j 


d-W 


l 




e ~ ikrT e ikrr r/ V = 1 


IvWV, 

Thus, the inner product in Equation (12.4) can be written as 

t * 


r— —e V(r)—=e 

Vv v/V 


(rlf f \V(r)\i,i) = J d 3 

“ y//r rtr*'*-*'" 

and the transition probability from Equation (12.4) becomes 


/ k, • r 


Pd -> /) = 


(Ef — Ej) 2 


sin' 


2 (( Ef - Ej)t 
2 h 


V 1 


f d 3 r V 

Jv 


/ (r y(k,-k,).r 




(12.5) 

Now we come to another problem: Equation (12.5) gives the transition prob¬ 
ability from a specific initial wave vector k, with energy trk~/2m to a specific 



264 


Chapter 12 Scattering Theory 



n z 

FIGURE 12.7 The number of states with energies between E and E + dE is given by 
1/8 of the volume of a thin spherical shell with radius n, where E = (h 2 n 2 /2ma 2 )n l . 


final wave vector k/ with energy h 2 k 2 /2m. In a physical scattering problem, it 
is certainly possible to control the initial energy, but not the final energy. Hence, 
Equation (12.5) must be integrated over all possible final energies Ej. In doing 
this, however, there is an additional complication: there are more quantum states 
corresponding to higher energies than to lower ones. For instance, recall from 
Chapter 6 that a particle in a cubic box of side a has an energy given by 



If the energy is fixed to lie in the small range between E and E -E dE, then the 
number of different values of n x , n y , and n z that produce an energy in this range 
increases as E increases. To quantify this, consider the three-dimensional “space" 
defined by n x , n y , and n z (Figure 12.7). We now fix the energy E and calculate 
how many states have energies between E and E + dE. This number is given by 
1/8 of the volume of a thin spherical shell with radius n, where 


** 2 - M % i M % ) % 

n =n x +n y + n z 

and the factor of 1 /8 arises because the physical states all lie in the octant defined 
by n x > 0, n y > 0, and n z > 0. If N(E)dE gives this number of states, we get 



( 12 . 8 ) 



12.2 The Born Approximation 


265 


sin 2 ax 


jr 



FIGURE 12,8 In the limit a —> oc, the function si nr ax/x 2 becomes sharply peaked and 
can be approximated as a multiple of the delta function. 


and using Equation (12.8) to rewrite the right-hand side of Equation (12.7) in terms 
of E rather than ru we get 


N(E)dE = 


V (2/m) 3 / 2 
4^2 /l 3 




(12.9) 


In this equation, N(£) is called the density of states. Physically, Equation (12.9) 
says that there are more ways for a system to have a large energy than a small 
energy (and the density of states scales as the square root of the energy). 

Now Equation (12.5) can be multiplied by N{Ef)dE / and integrated over the 
final possible energies. However, we also have to include the fact that the density 
of final states is proportional to dQf/An. Hence, the transition probability dP for 
scattering into a small solid angle dQ. / is 


dP(i 


any £) = j P(i f)N{E f )dE f 


dQf 

4tt 


f 4 

J ( E f - E,r- ^ v 


(E f - Ej) 2 
V (2m) }/ 


An 2 h 


^~/T f dE 


2 h 

dQ f 

An 


V- 


j d 3 r V(r)e iik '- k > hr 
( 12 . 10 ) 


Note the factor sin 2 [(£/ — E,)t /2h]/(E f — £,) 2 in Equation (12.10). The generic 
function sin 2 (ax)/x 2 becomes arbitrarily sharply peaked as a —>• oo (Figure 12.8). 
More rigorously. 


sin* ax 

-— -> naS(x), as a oc 

x l 


( 12 . 11 ) 




266 


Chapter 12 Scattering Theory 


Since we are interested in the behavior of this system in the limit where t becomes 
large, we can use Equation (12.11) to write 


sin 2 [(£’ / - Ei)r/2h] 
_____ 


n{t/2h)6(Ef - Ei) 


When we substitute this delta function into Equation (12.10), the integration over 
Ef picks out the value E t - = £,. This makes sense on physical grounds; in the 
limit where perturbation theory is applicable, scattering off of the potential should 
not change the energy of the incident particle by very much, which implies that 
Ef % E,. After performing this integration and taking £,■ = h 2 kj/2m , we obtain 


any £) = [ d 3 r V (r)e i{k '- k ' hr ( 

71 nV Jv \ 47T 


( 12 . 12 ) 


with the added restriction that |k/1 = Jk,-1, since the delta function integration gave 

£/ = £,, 

There remains one final task; determining the relationship between the transition 
probability dP in Equation (12.12) and the differential cross section da/dQ. At the 
beginning of this section, we derived the relationship between the total scattering 
cross section and the scattering probability in terms of the total distance L travelled 
by the incident particle (Equation 12.1). If the incident particle has velocity v t and 
travels for a time t, we can substitute L = v,l into Equation (12.1) to obtain an 
alternative relation between scattering probability and cross section 


P = nvjta 



Substituting the expression for dP given by Equation (12.12) into Equation 
(12.13), using Vi = p t jm — hk , / m and assuming one target per volume V so that 
n = 1 /V, we get a final expression for the differential cross section; 

(12.14) 

where we have now taken the limit where V goes to infinity, so the integral is over 
all space. Equation (12.14) is called the Bom approximation. Further, when this 




12.2 The Born Approximation 


267 



FIGURE 12.9 The momentum transfer K = k, - k i gives the change in momentum hK 
of the particle as it scatters. This figure shows that K = 2k sin($/2). 


approximation is valid, it will always be true that the incident wave vector k, and 
the scattered wave vector k / satisfy |k, | = |k/|. 

The Born approximation can be expressed in a more compact form by defining 
a new wave vector K (Figure 12.9): 


K = k f — k, 


where K is called the momentum transfer because hK gives the change in the 
momentum of the particle as it scatters. Taking 9 to be the scattering angle, i.e., 
the angle between k/ and k,, and taking k — (k, | = |k/|, Figure 12.9 shows that 


K = 2k sin(0/2) 


(12.15) 


In terms of the momentum transfer, the Bom approximation is 



(12.16) 


When the scattering potential is spherically symmetric | V (r) = U(r)]. the ex¬ 
pression for the cross section can be further simplified. In this case we can expand 
the Born approximation out in spherical coordinates and perform the integrals over 
9 and 4>. Equation (12.16) becomes 


da ( m 

</Q \ 2nf? 

The integral over 4> just gives 2n, and the integral over 9 can be simplified by 
choosing a coordinate system so that the polar axis points in the direction of K: in 
this case K • r = Kr cos 9 (note that this is not the same 9 that appears in Equation 


j sin 9 d9 d(j) r 2 dr V(r)e ,K-r 





268 


Chapter 12 Scatteri ng Theory 


V(r) 

1 



FIGURE 12.10 A repulsive spherical well has V = V 0 inside a sphere of radius R, and 
V = 0 outside of this sphere. 


(12.15) and Figure 12.9). This gives 

—- = ( — ? ^ If sin 6 dd d(p r 2 dr V(r)e~ lKrcos6 

dQ. \2nh 2 J I J 

Integrating over both 4> and 0, the expression for the cross section simplifies to 


(12.17) 


Equation (12.17) is the form of the Bom approximation applicable to any 
spherically-symmetric potential. 


Example 12.2. Scattering from a Three-Dimensional Repulsive 
Spherical Well. 

A particle scatters from the three-dimensional repulsive spherical well of radius R 
given by (Figure 12.10): 

V(r) = Vo, r < R 
V(r) = 0, r > R 

The energy of the particle is much greater than Vo- Use the Bom approximation to 
calculate the differential cross section. 



12.2 The Born Approximation 


269 


This is a spherically-symmetric potential, so we can use the form of the Bom 

approximation given by Equation (12.17). This gives 


da 4m 2 
dQ = 



sm(Kr) Vgr dr 


9 


Integrating over r, 

da _ 4m 2 Vj /sin (KR) Rcos(KR )\ 2 
dQ ~ ¥k~ 2 V K 2 K ) 

Normally, do/dQ is expressed in terms of the scattering angle 0; recall that the 
magnitude of the momentum transfer K and the scattering angle 6 are related 
through K = 2k sin(0/2), where A is the magnitude of both the incident and scat¬ 
tered wave vector. Using this expression for K, 

da 4m 2 /2 6 /sin|2A/? sin(0/2)] — 2kR sin(0/2)cos[2A/? sin(0/2)] 

7q = ( \2kR sin(6*/2)p 

A graph of the differential cross section as a function of A /? sin(0/2) is given in 
Figure 12.11. The differential cross section has a large central peak at small values 
of A R sin(0/2) with tiny oscillations (barely visible on the scale of this figure) at 
larger values. 


Here is another example of the Born approximation, this time with a potential 
that is not spherically symmetric. 


Example 12.3. Scattering from a Delta-Function Potential. 

(a) A particle of mass m scatters off of a delta-function potential at the origin: 

ah 2 

VU,y,z) =-<5 (.y)S(v)S(z) 

m 

(Here a is a constant with units of length, and the constant in front of the delta 
functions ensures that V has units of energy.) Use the Born approximation to 
calculate da/dQ. 

(b) Repeat this calculation with the delta-function potential located at the point 
(6,0,0), so 

ah 2 

V(x, y, z ) = -<$(jc - b)8(y)8(z) 

m 

(c) Now suppose that there are delta-function potentials at both (0, 0, 0) and 
(6, 0,0), so that 

V(x. v, z) = -—[5(x - h)8(v)8(z) + 5(x)<5( v)5(z)] 
m 







270 


Chapter 12 Scattering Theory 


da 

dQ. 

k 



FIGURE 12.11 The differential cross section da/da as a function of kR sin(0/2) for a 
repulsive spherical square-well potential of radius R, where k is the wave number of the 
incident particle and 9 is the scattering angle. 


Calculate do/dQ in this case, and compare it to the sum of the cross sections in 
(a) and (b). 

(a) Since we are dealing with potentials which are not symmetric about the 
origin, we use the full Bom approximation. Equation (12.16): 


da 

dtt 


j j d 3 rV(r)C iK " 


2 


Substituting the first delta-function potential from part (a) gives 


da 

dQ 


f \ 2 ^ 

) f dx dv dz-~8(x)8(y)8(z)e~ iiK ' x+K > y+K:Z) 

nh“ / J m 


The integral over the delta function picks out the value x = 0, y = 0, z = 0 in the 
exponential, giving e° = 1, and the cross section reduces to 


do a 2 
dQ 4n 2 


Note that the cross section has units of area, as expected. 



12,3 Partial Waves 


271 


(b) For the delta function at (h. 0, Oh we have 


da 


d Q \2nh 


s?)7 


dx dx dz—8(x - b)8(x)8(z)e- i{K ' r+K ' y+K:i) 
m 


= _£_ \e~ iKxh \ 2 

Art 2 1 1 

Of 

rr 

= An 2 

i.e., the cross section is unchanged when the delta-function potential is moved to 
a different position. 

(c) Now consider the case with both delta functions together: 


do 

dQ. \2 rch 


m \~ f ah" 

—~ ) / dx dx dz -|5(x — b)8(v)8(z) 

rth") J m 


+ 5U)5(v)SU)]<?' 


— *( K A x+K y v-f Kz) 


An 1 


i K x h , .-< 0)1 


Using the identity 


f! + e Lx \ 2 = 4 cos 2 (a: /2) 


the cross section reduces to 



Now we see an interesting result: the cross section calculated in part (c) is not 
the sum of the cross sections in (a) and (b). The reason is the wave nature of the 
scattering particle: just as for optical scattering, there is interference between the 
waves scattering off of the two delta functions, so the individual cross sections 
from the two delta functions cannot simply be added together. 


As noted several times, the Born approximation is valid when the potential can 
be treated as a small perturbation, i.e., when the energy of the incident particle 
is much greater than V. In the next section we consider the opposite limit of 
low-energy scattering. 


12.3 ■ PARTIAL WAVES 

Although the method of partial waves is simplest when the energy of the incident 
particle is low, we do not need to introduce this assumption immediately. Suppose 
we have an incident particle moving in the z direction, and it scatters off of a 




272 


Chapter 12 Scattering Theory 


spherically-symmetric potential V'(r) centered at the origin. As in the previous 
section, we will assume that V{r) —» 0 as r —► oo. The time-independent wave 
function for the incident particle far from the origin is 

ft = e ikz 


but what about the wave function \j/f for the scattered particle? In general, the 
scattered particle is given by a wave expanding radially outward from the scattering 
potential with an amplitude that depends on the angular direction. 

To construct the wave function corresponding to such a wave, consider first the 
case of perfect spherical symmetry, i.e., a wave expanding radially outward with 
equal amplitude in all angular directions. This wave represents a solution to the 
radial Schrodinger equation (Equation 6.39): 


h 2 a 2 h 2 l(l + 1) 

— (rR(r)) + —- ^-rR(r) + V(r)rR(r) = ErR(r) 

2m dr L 2mr - 


Since we are interested in a freely-expanding wave, we take V = 0, and the 
condition that the wave be spherically symmetric imples that / = 0. Further, we 
would like to express the particle momentum in terms of k, rather than E, where 
E = h 2 k l /lm. Then the radial Schrodinger equation becomes 


a 2 

~(rR) + k~(rR) = 0 


which has the general solution 


Jkr e ~ikr 

R = a-+ B - 

r r 

where A and B are constants to be determined. The first term represents a radially 
outgoing wave, while the second term represents a radially incoming wave. Thus, 
for a scattered particle, the second term makes no sense on physical grounds, so 
B must be zero, giving 

e ikr 

R = A - (12.18) 

r 

Equation (12.18) represents the radial equivalent of a plane wave. Just as e ,kz gives 
a wave moving in the z direction, the quantity e lkr /r represents a spherically- 
symmetric wave moving radially outward (Figure 12.12). 

Now, if a particle scatters from the potential, the outgoing wave need no longer 
be isotropic. Since we have assumed a spherically-symmetric potential, the am¬ 
plitude of the scattered wave can be a function of 9, but it should be independent 
of <f). The most general form we can write for a radially-expanding wave with a 
dependence on 9 is 

e ikr 

= fm — 

r 



12.3 Partial Waves 273 



FIGURE 12.12 The wave function i/r = e' k: represents a wave moving in the z direction; 
\p = e' kr /r represents a spherically-symmetric wave moving radially outward. 


and the total wave function far from the origin, including both the incident and 
scattered particle, will be 


fr = ti + i'f 

= e ikz + f(0)~ (12.19) 

r 

The cross section is a function of /(0). To determine this function, we express 
the cross section in terms of scattered energy rather than discrete particles: 

da scattered energy/solid angle 
dQ, incident energy 

\fi\ 2 

Substituting our expressions for and ff gives 


da 

dQ 


\/m 2 


So the problem of determining the differential cross section reduces to solving the 
Schrodinger equation in order to find f (6). 

We will derive such a solution in the region far from the potential, so that V = 0. 
The radial Schrodinger equation for V = 0 can be written as 


a /(/+i) , 

— (rR(r)) - - - -R(r) + k\rR(r)) = 0 

dr* r 


This equation can be solved exactly for any value of /; the solutions for R(r) are 
called spherical Bessel functions and are written as ji(kr). These solutions, for the 



274 


Chapter 12 Scattering Theory 


first few values of/, are 



sin (kr) 



/ = 0 : 

« = Jo(kr) = 

kr 




sin (kr) 

cos (kr) 

/ = 1 : 

« = J\(kr) = 

(kr) 2 

kr 



r 3 

11 

. , 3 cos (kr) 

1 = 2: 

R = jj(kr) = 


— — 

sin (kr) 



L (kr) 3 

kr _ 

(kr) 2 


As usual, the general solution to the Schrodinger equation is the product of this 
radial solution and the appropriate spherical harmonic: 


i Hr, 0. 0) = ji(kr)Y^(0. 0) (12.20) 

However, this result leads to a puzzle. We already have a set of solutions to the 
Schrodinger equation with V = 0, namely, 

0 = e ik ' r (12.21) 

Thus, it would appear that there are two different sets of solutions to the three- 
dimensional Schrodinger equation for the case of V = 0. In fact, both sets of 
solutions [Equations (12.20)—(12.21)] are valid. They simply represent solutions 
in spherical and rectangular coordinate systems, respectively. Furthermore, either 
set of solutions can be used as a basis set. This means, for example, that an incoming 
wave in the z direction can be written as a sum of spherical waves: 

e ikz = (0.0) (12.22) 

l.m 

where c/,„’s are the constants in the expansion. 

Physically, Equation (12.22) corresponds to expressing a plane wave as the 
sum of wave functions with different angular momenta. This expansion is useful 
because we can now show that, in the low-energy limit, it can be a good approxi¬ 
mation to retain only the / =0 term. To see the reason for this, consider a particle 
scattering off of a potential with fixed radius ro, so that V is negligible for r > r©. 
The spherical Bessel functions have the property that for kr <§; /, 

ji(kr) oc (kr) 1 

Thus, if k is sufficiently small that kr 0 1, all of the spherical Bessel functions 
are negligible in the vicinity of the potential (r < r©) except for / = 0. In this case 
only the / = 0 component of the incident plane wave “feels” the potential. 

This argument has a simple classical analog. Consider a particle with momentum 
p scattering off of a potential that vanishes for r > /•© (Figure 12.13). Let s be the 
closest distance that the particle attains relative to the center of the potential. Since 
the potential is negligible for distances greater than r©, scattering can occur only 




12.3 Partial Waves 


275 



FIGURE 12.13 A particle with momentum p scatters off of a potential V of radius r ( ). 
The potential will affect the particle only if the point of closest approach .v is smaller 
than To. 


if s < ro. But this immediately tells us something about the angular momentum 
of the particle relative to the origin. The angular momentum is L — ps, and the 
requirement that s < ro translates into the relation 

L < pr Q 

Thus, in the classical case, low-energy scattering also translates into low angular 
momentum scattering. 

We can assume then that in the low-energy limit, only the / = 0 component of 
the incoming wave is scattered and the higher / waves are unaffected. Therefore, 
this approximation is called s-wave scattering where s indicates that / = 0. In this 
limit we write the sum in Equation (12.22) as 

e ikz = cwMkr)Y$(6 , <j>) + £ c lm j,(kr)Y”(d . 0) (12.23) 

l>0 

To find coo we multiply both sides of Equation (12.23) by Yq*, set z = r cos 0 on 
the left-hand side, and integrate over 0 and 0. Because of the orthogonality of the 
spherical harmonics, only the first term on the right-hand side contributes, and we 
get 

J ^krca^yO* smeddd( j ) = j coojo(kr)\Y$\ 2 sin 6 d0 dtp 

Taking Y® = 1/a /Art and jo(kr) = sin (kr)/kr and performing the integrations 
gives 


1 


2n 




„—ikr\ _ 

e ) — too 


sin (kr) 


kr 


(12.24) 






276 


Chapter 12 Scattering Theory 

Now we recall that sin(jr) = (e ix — e~' x )j2i, so Equation (12.24) reduces to 


coo 


= V47T 


Using this value for coo and writing ja = sin (kr)/kr in terms of complex expo¬ 
nentials, we can express Equation (12.23) as 




terms 


Note that this represents the incident wave i/r,-. What does the scattered wave 
look like? Since we have assumed that only the / = 0 part of the wave is actually 
affected by the potential, the only contribution of the scattering will be an outgoing 
spherical wave with l — 0. Thus, the total wave function \fo = ft + will be 



(12.25) 


where we have dropped the / > 0 terms, and the effect of the scattering has been 
to add a term proportional to e ikr /kr. We do not yet know the amplitude of this 
additional term, so we have absorbed it into a new unknown complex number r?o, 
which we need to calculate. 

Conservation of particle probability means that |^o| 2 = 1, so it is conventional 
to re-express rj o in terms of a function with unit magnitude 



(12.26) 


where So, defined by Equation (12.26), is called the s-wave phase shift. In order to 
calculate a cross section, we need to write xj/ T in Equation (12.25) in the form of an 
incident plane wave and a scattered spherical wave (as in Equation 12.19). Pulling 
out the terms that correspond to the incident plane wave e‘ kz and expressing rjo < n 
terms of So, we get 


f T =e ikz + 


/e 2iS ° - 1 
\ 2 ik 


) 


€ 


ikr 


r 


Then the cross section is 


da 




e 2l&0 — 1 

2 


2 ik 



sin 2 So 


k 2 






12.3 Partial Waves 


277 


Note that the differential cross section in this case is completely isotropic, i.e., 
independent of the scattering angle. This is because of our assumption of s-wave 
scattering; in this limit only the / = 0 part of the wave undergoes scattering, and 
the l = 0 wave is isotropic. The total cross section is then just 



(12.27) 


Of course, the problem now is to find the phase shift this calculation is performed 

by solving the Schrodinger equation for the scattering potential, as shown here. 


Example 12.4. Low-Energy Scattering from an Infinitely Hard Sphere. 

A particle is incident on a central potential V (r) which is infinitely high at r < a 
with V = 0 for r > a. The energy of the particle is sufficiently low that 5-wave 
scattering is a good approximation. Find the total cross section. 

The wave function outside the potential is given by Equation (12.25): 

1 ( e ikr e' ikr \ 

* - a r *r - ir) 

1 / g'* r + 2i *0 £ - '* r \ 

= 2 1 kr kr~ ) 

At the surface of the potential, r = a, the infinite potential forces the wave function 
to zero, so ( r = a) = 0. This means that 

e ika+2iSo _ e ~ika _ q 

which has the solution So = — ka. Then the s -wave expression for the cross section, 
Equation (12.27), gives 


a — An 


sin 2 ka 
k 1 


In the low-energy limit, k ->■ 0, and we get 


a — Ana 2 

Note that this is four times the classical scattering cross section from a hard sphere 
(Example 12.1). In the classical case, the incident particle “sees” the geometrical 
cross-sectional area of the sphere, which is just na 1 . In the quantum mechanical 
■ system, the incident particle acts like a wave and diffracts around the target, giving 
a larger cross section. 


(M 


The idea of s-wave scattering can be extended by summing over all of the l's in 
the spherical wave expansion and finding the phase shift Si for each partial wave. 




278 


Chapter 12 Scatteri ng Theory 

The result is a total cross section which looks like 



The case of 5 -wave scattering, which is the only one we have examined in detail, 
can be obtained from this result by taking only the / = 0 term in the series. 


EXERCISES 

12.1 Show that in the Bom approximation, the differential cross section obtained from the 
negative of a given potential —V(r) is exactly the same as that obtained from the 
potential V(r), 

12.2 An incident particle with mass m, velocity u, and charge ze scatters off of a charge 
Ze at the origin* Use the Born approximation to calculate the differential scattering 
cross section for the screened Coulomb potential 

V(r) = (zZe 2 /47t€ 0 r)e~ r/d 

Then let d -¥ oo, so that V(r) approaches the normal Coulomb potential, and show 
that da/dQ approaches the Rutherford scattering differential cross section 

da / 1 \ / zZe 2 \ 1 

^4jre 0 / \2mv 2 J sin 4 (0/2) 

12 3 (a) A particle with charge +e is incident on an electric dipole consisting of a charge 
of +<? and a charge of —e separated by the vector d (which runs from ~e to +e). 
The energy of the incident particle is sufficiently large to treat the dipole as a 
small perturbation. Calculate the differential scattering cross section da/dQ as 
a function of the initial wave vector k, , the scattered wave vector ky, and the 
standard Rutherford scattering cross section (da/dQ) Rj given by 


\ - 

2 

- m f ^rV(r')P~‘ K c 


2nh*J drVc(r)e 


where K = ky — k; and V c (r) is the Coulomb potential. 

(b) In the limits kid. <$: 1 and it, d ;$> 1 , determine whether the dipole differential cross 
section is larger or smaller than the Rutherford differential cross section. Explain 
the physical reason for these results. 

12.4 A particle which is travelling in the +Z direction scatters off of a potential consist¬ 
ing of four delta functions at the vertices of a square in the x-y plane at the points 
(-a, 0, 0), (+«, 0, 0), (0, -a, 0), and (0, +a, 0). 




Exercises 


279 


7 


♦ 


♦ 




The potential is 

V = AS(x + a)S(y)S(z ) + AS(x - a)S(y)S(z) 

+ A8(x)8(y + a)S(z) + A<5(*)£(;y - a)i(z) 

where A is a constant. Use the Bom approximation to calculate the differential scat¬ 
tering cross section. Express the answer in terms of the magnitude of the incident 
wave vector k and the scattering angles 0 and 0. (This is an example where the cross 
section does depend on <j>.) 

12.5 (a) A particle of mass m and energy E scatters off of the central potential V(r) = 

Ar ~ 2 , where A is a constant. Use the Bom approximation to calculate the differ¬ 
ential cross section dofdQ. as a function of E and the scattering angle $, 

(b) Show that the total cross section a is infinite. 

12.6 A particle of mass m is incident on the potential V (r) = Vbe“ r/r °, where V Q and r 0 are 
constants with units of energy and length, respectively. The potential is independent 
of 0 and <p. The energy of the particle is large, so that the potential can be treated 
as a small perturbation. Calculate the differential scattering cross section dv/dQ. 
Express the final answer as a function of the scattering angle 0 and the energy of the 
particle E. 

12.7 (a) A particle with mass m scatters off of the potential 

V = A8(z), for -a < x < a and —a < y <a 

V = 0, otherwise 

where A is a constant. In other words, this potential forms a square in the x-y 
plane and is infinitesimally thin in the z direction. Use the Bom approximation to 
calculate the differential scattering cross section. (Assume an arbitrary direction 
for the incident particle, and express the answer in terms of the momentum transfer 
vector K = k f — k,*.) 

(b) Show that in the limit where a -> oo (so that the scattering potential occupies 
the entire x-y plane), the resulting differential cross section corresponds to only 



Chapter 12 Scattering Theory 

two possible results: either the particle will pass through without any scattering 
or else it will scatter with the angle of incidence equal to the angle of reflection, 

12.8 Suppose a particle with mass m scatters off of a finite spherical potential of radius R 0 
given by 


V(r) = V 0 (!-<*„) 
V(r) = 0 (r > R q ) 


In the limit where the energy of the incident particle is small, show that the total 
cross section is 


„ n2 /tanh(*K 0 ) 


■) 


2 


where k = ^/2m(V o — E)/h 2 . 



CHAPTE 

13 


13.1 


The Multiparticle Schrodinger 

Equation 


Thus far, we have examined the solution of the Schrodinger equation for a single 
particle. However, the real world consists of systems composed of many particles. 
While it is straightforward to generalize the Schrodinger equation to systems of 
many particles, something very interesting happens when dealing with systems 
of identical particles: the requirement that two particles be treated as identical 
imposes certain restrictions on the properties of the wave function. 

When dealing with systems containing multiple particles, we will not write 
down a separate wave function for each particle. Instead, we will have a single 
wave function that encodes the information about all of the particles together. 
We will show that for systems of identical particles, the wave function is either 
unchanged or multiplied by — 1 when any two of the particles are exchanged. This, 
in turn, leads to an important result called the Pauli Exclusion Principle , which 
turns out to be crucial to the very existence of matter as we know it. 


■ WAVE FUNCTION FOR IDENTICAL PARTICLES 

For a single particle with definite energy E, the Schrodinger equation has the 
familiar form 


~VV(r) + V(r)*(r) = £*( r) 
2 m 


Now suppose that we have two particles, not necessarily identical. In order to 
treat these two particles as a single system, we must have a single wave function 
which combines the information for both particles (i.e., instead of separate wave 
functions for each particle). This wave function is written as ^(rj, rj). The phys¬ 
ical interpretation of this wave function in terms of probabilities is similar to the 
one-particle wave function. Recall that for a single particle, |^(r)l 2 d 3 r gives 
the probability that the particle can be found in a small volume d 3 r near r. For 
the two-particle wave function, IVKri, r 2 )| 2 d 3 ri d?T 2 gives the probability that 
particle 1 is located in the small volume d 3 ri near n and particle 2 is located in 
the small volume d 3 f 2 near r 2 . 


281 



282 


Chapter 13 The Multiparticle Schrodinger Equation 


Example 13.1. Interpretation of the Two-Particle Wave Function. 

Two particles are confined in an infinite one-dimensional square well with width 
a. They are in a state with a wave function given by 


= - sin 
a 



A measurement is made of the positions of both particles. Calculate the probability 
that particle 1 is on the left-hand side of the potential well (j: < a/2) and particle 
2 is on the right-hand side (jc > a/2). 

This probability is 


n a 

2 - 


\if(x\,x 2 )\ 2 dxidx2 


raj2(' 




= f f 

JX 2 =afi “ \ a J \ a / 



The two-particle wave function obeys the Schrodinger equation in 

the form 


^2 2 

"2^ V ^( ri ’ r2) ” r 2 > + V(r,, r 2 )^(ri, r 2 ) = Ef{ r ( , r 2 ) 

(13.1) 

where the symbol is the sum of second derivatives taken with respect to rj: 

dxf + dyf + dzj 

and, similarly, for Vj the derivatives are taken with respect to r 2 : 

2 _ _3^ d^_ 

2 9x| 9^2 dz 2 

and the potential V is now a function of the positions of both particles. So far, this is 
a straightforward generalization of the one-particle wave function and one-particle 
Schrodinger equation. Now, however, we introduce a twist. Subatomic particles 
have the property that any two of the same kind of particle (two protons, two 
electrons, etc.) are indistinguishable. This means that they have exactly the same 




13.1 Wave Function for Identical Particles 


283 




© 



i Electrons 
j interchanged 

i 

i 

i 

* 


i Electrons not 
! interchanged 

i 

i 

i 

* 



© 


© 



2 


FIGURE 13.1 Electron 1 and electron 2 are indistinguishable. It is therefore impossible 
to determine whether or not they have been interchanged. 


mass, the same spin, etc. In practical terms, consider the following experiment: 
two electrons are placed at two different locations in a laboratory (Figure 13.1). 
You turn your back, and the laboratory assistant either leaves the electrons in place 
or interchanges the two particles. When you again observe the electrons, there is 
literally no way to determine whether or not the electrons have been interchanged, 
since they are absolutely identical. This is a practical definition of indistinguishable 
particles. This property of indistinguishability has profound consequences. 

To determine these consequences, define a new operator called the exchange 
operator , £ l2 which has the effect of interchanging particle 1 and particle 2. Thus, 
the effect of the exchange operator on the wave function is 

£i2^( r i,r 2 ) = ^(r 2 ,r!) 

It is possible to show (Exercise 13.1) that the exchange operator commutes with 
the Hamiltonian as long as the two-particle potential has the property that 


V(r,,r 2 ) = V(r 2 .r,) (13.2) 

In fact, most reasonable potentials have this property; the most common situation 
is for two particles each to be subject to the same external potential, and to interact 
with each other via a potential that is symmetric under the interchange of the two 
particles. For example, each electron in the helium atom experiences the Coulomb 
potential of the nucleus, and the electrons also repel each other via a Coulomb 





284 


Chapter 13 The Multiparticle Schrodinger Equation 
potential, leading to the potential 


V(n, r 2 ) 


2e 2 1 2e 2 1 e 2 1 

4jt€ 0 ri 47T€ Q r 2 4;re 0 In - r 2 | 


(13.3) 


Clearly, this potential satisfies Equation (13.2). We will assume throughout the 
remainder of this chapter that we are dealing exclusively with Hamiltonians that 
commute with the exchange operator. 

Therefore, we can take the two-particle wave function to be an eigenfunction 
of En and attempt to determine the possible eigenvalues. Let y be an eigenvalue 
of £ 12 : 


E\ 2 ^{ri,r 2 ) - y^(n,r 2 ) 

Now note that for any wave function, applying £\ 2 twice simply restores the original 
wave function; the particles are interchanged, then interchanged back to their 
original positions. Mathematically, 

£? 2 ^( r i’ r 2) = £i 2 lM r 2 . ri) = ^(ri, r 2 ) (13.4) 

But if \(f{r i, r 2 ) is an eigenfunction of £12 with eigenvalue y, then applying £j 2 
twice will pull out a factor of y 2 : 

£? 2 lKri. r 2 ) = yV(ri, r 2 ) (13.5) 

Combining Equations (13.4) and (13.5) gives the possible values for y: 

Y 2 = 1 


so 


y = ± 1 


Therefore, the exchange operator can produce only two possible results when 
applied to the wave function. If y = 1, then the eigenfunction equation gives 
£i 2 ^(ri, r 2 ) = rffiri, r 2 ), while the definition of £12 gives £i 2 ^(ri,r 2 ) = 
lKr 2 , rj) so that 


^(ri,r 2 ) = ^(r 2 , rj) 


(13.6) 


and the wave function is symmetric under the interchange of the two particles. Con- 
versely.ify = — l,then£i 2 ^(ri, r 2 ) = -^(n, r 2 ),and£i 2 Vr(ri, r 2 ) = ifiX 2. r i) 
so 


^(ri,r 2 ) = —tfr( r 2» r i) 


(13-7) 


and the wave function is antisymmetric under the interchange of the two particles. 





13.1 Wave Function for Identical Particles 


285 


It is observed in nature that any given particle obeys either Equation (13.6) or 
Equation (13.7) but not both. Particles with wave functions that are symmetric 
under particle interchange are said to obey Bose-Einstein statistics and are called 
bosons, while particles with wave functions that are antisymmetric under parti¬ 
cle interchange obey Fermi-Dirac statistics and are called fermions. It is further 
observed that the category that a given particle belongs to is entirely determined 
by its spin. Particles with integer spin ( s = 0,1,2,...) behave as bosons, while 
particles with half-integer spin (s = 1/2, 3/2,...) behave as fermions. Hence, the 
electron, proton, and neutron, each with spin 1/2, are fermions, while the photon, 
with spin 1, is a boson. 

This result leads immediately to the Pauli-exclusion principle, which states 
that two fermions cannot occupy the same quantum state. For example, suppose 
that the two electrons in the helium atom have all of the same quantum numbers, 
and represent the two-particle wave function in Dirac notation as 11 2 ). If the 
two electrons have exactly the same quantum numbers, then the wave function is 
invariant when the two electrons are exchanged, i.e., 11 2) = |2 1). However, this 
contradicts the fact that the wave function of two fermions must be antisymmetric 
under their interchange: |1 2 ) = —12 1 ). Hence, two fermions cannot occupy 
the exact same quantum state. If it was not for the exclusion principle, all of the 
electrons in a multielectron atom would simply drop down into the ground state, 
n = 1. It is the exclusion principle which forces the electrons into higher-energy 
states, making atoms and chemistry as we know it possible. (This is discussed in 
more detail in Section 13.2.) 

More generally, the symmetry or antisymmetry of the wave function restricts the 
allowed solutions of the Schrodinger equation. As an example, consider a potential 
of the form 


V(n, r 2 ) = V 0 (r,) + V 0 (r 2 ) + V,(|r, - r 2 |) 

for which both particles experience the same external potential Vo while interacting 
with each other via the potential V x . The Schrodinger equation is difficult to solve 
with a general potential of this type, but it is instructive to take the limit where 
V\ <$£ Vo, so that as a first approximation, the interaction potential can be ignored. 
Even in this limit, the particles still affect each other through the requirement 
that the wave function be either symmetric or antisymmetric. When V x <g; V 0 , the 
two-particle Schrodinger equation (Equation 13.1) takes the form 


ft 2 ft2 

— r 2 ) + V( r i)^( r i, r 2 ) - — V|tKn,r 2 ) + V(r 2 )^(r,, r 2 ) = E\j/{ n,r 2 ) 

(13.8) 

where we have dropped the “0” subscript on the potential for simplicity. Equation 
(13.8) is applicable whenever two particles experience the same external potential 
but do not interact directly with each other. 




Chapter 13 The Multiparticle Schrodinger Equation 


This equation resembles the sum of two single-particle Schrodinger equations 
and this property can be exploited to find a solution of the two-particle equation 
Suppose that the single-particle Schrodinger equation, with potential V, 

h 1 2 

VV(r) + V(rMr) = W r) 

2m 

can be solved exactly, yielding single-particle wave functions and corresponding 
energies t/r„(r) and E n , respectively. Then we can verify by direction substitution 
into Equation (13.8) that the product of any two of these solutions for the two 
particles, ^(rO^fo), is a solution of Equation (13.8): 

n 2 

-2^v2[^ m (ri) ^(r 2 )] + V(r 2 )[^ m (r,)^(r 2 )] 

^n(r 2 ) + V ( r 0^( r l)j + *.<*) [~^~ V 2^n( r 2> + 

f n (r 2 )E m f m (ri) + fm(ri)£/,f fl (r 2 ) 

(Em + E n )ir m ( ri)Vr«(r 2 ) 

Thus, a general solution to Equation (13.8) is the wave function r 2 ) given 

by 


'I'mniTi, r 2 ) = ^ m (ri)^„(r 2 ) (13.9) 

where and are the eigenfunctions of the one-particle Schrodinger equation 
with potential V , and the energy corresponding to rj/ mn is 


E m + E„ 


Although i/r m „ satisfies the two-particle Schrodinger equation, this solution is nei¬ 
ther symmetric nor antisymmetric under the exchange of the two particles. The 
way to resolve this problem is to note that there are actually two different solutions 
which have the same energy E mn ; in addition to the solution in Equation (13.9), 
there is another solution obtained by switching m and n: 

fnm (rj, r 2 ) = if„ (r,) \jr m (r 2 ) 

Since both of these wave functions correspond to the same energy, E m + E„, any 
linear combination of them will also satisfy the Schrodinger equation and have 
energy E m + E„. In particular, these two solutions can be combined to produce a 




13.1 Wave Function for Identical Particles 


287 


wave function that is symmetric under interchange of the two particles: 

(13.10) 

and a wave function that is antisymmetric under interchange of the two particles: 

(13.11) 

where the 1/V2 factor insures that the two-particle wave functions are normalized 
as long as the individual one-particle wave functions are correctly normalized. 
These then are the symmetric and antisymmetric solutions to the two-particle 
Schrodinger equation. 

One special case must be treated separately: the wave functions for which n = 
m. For the case of antisymmetric wave functions. Equation (13.11) gives \j/„„ — 0, 
so such wave functions are not allowed. (This is another corollary of the exclusion 
principle.) Thus, if the allowed single-particle wave functions correspond to the 
quantum number n, where n — 1,2, 3,..., then for antisymmetric wave functions, 
the lowest-energy state is n = 1, m = 2. For symmetric wave functions, there is no 
similar restriction; the state m = n gives a perfectly acceptable wave function, and 
m = 1, n = 1 is the lowest-energy state. However, in this case the normalizing 
factor 1/V2, which is derived based on the assumption that lAmtrO^nto) and 
V f n( r i)^m(t‘ 2 ) are orthogonal wave functions, is no longer correct. Instead, the 
normalized wave function is simply 

(13.12) 

The spin states of the particles produce an additional complication, but before 
adding this complication, consider a solution for spinless particles. 

Example 13.2. Two Identical Spin-0 Particles in an Infinite One-Dimensional 
Square Well. 

Two identical spin-0 particles are confined in an infinite one-dimensional square 
well with width a. Find the energy levels and corresponding wave functions, and 
determine the ground state. 

The individual wave functions for the infinite one-dimensional square well are 

\fr{x) = 







with energy 






288 


Chapter 13 The Multiparticle Schrodinger Equation 


Since the particles have spin 0, they are bosons, and their wave function must be 
symmetric as in Equations (13.10) and (13.12). Therefore, the total wave function 
is 

1 2r, . /mtx 2 \ , . /nnx\\ . /mnx 2 \-\ 

= ^ - [s.n (—) *■" (—) + an (—) an (—)] 

for m n, and for m = n the solution is 

2 . /mi xi\ . /m tx 2 \ 

tnn = - sm {-1 sm (-1 

a V a > \ a / 

with, in either case, a corresponding energy of 


E 


mn 


— Em rf E n 


jr 2 ft 2 
2 ma 2 


[m 2 + n 2 ] 


The ground-state energy is then 


Eu 


n 2 h 2 
ma 2 


and the ground-state wave function is 



This was the wave function used in Example 13.1. 


What happens when the particles have spin? As a specific example, consider 
the case of two spin-1/2 particles. Recall from Chapter 8 that the spin states can 
be expressed in terms of the total spin quantum number for the two particles s and 
the z component of this total spin m s . These states, in turn, are related to the “spin 
up” and “spin down” states of the individual particles through the relations: 


|1 1>= It t) 

"°> = W + w 

n - l) = u I) 

|00, = ji" 


where the states on the left-hand side of these equations are the |s m s ) states, and 
the states on the right-hand side are the spin up or spin down states of the two 
individual particles. 



13.1 Wave Function for Identical Rarticles 


289 





T 2 


I ^12 
I 
I 
I 

I 

+ 




FIGURE 13.2 The exchange operator interchanges both the positions and spins of the 
two particles. 


Now note an important point: All three of the triplet states (i.e., the states with 
s = 1) are symmetric under interchange of the two particles, while the singlet 
state (j = 0) is antisymmetric under interchange of the two particles. When the 
exchange operator is applied to two particles, it interchanges both their spatial 
positions and their spins (Figure 13.2). Hence, in considering whether a wave 
function is symmetric or antisymmetric under particle exchange, the full wave 
function must include both the spatial wave function and the spin wave function. 
For two spin-1/2 particles, for example, we write the full wave function |1 2) 
as the product of the spatial part of the wave function and spin part of the wave 
function: 


|12)^- f (ri,r 2 )|s m s ) 

and we require that this total wave function be symmetric for bosons and anti¬ 
symmetric for fermions. Since the spatial part of the wave function can be either 
symmetric or antisymmetric, and the spin part of the wave function can be either 
symmetric or antisymmetric, there are only four possibilities: 

( r i, r 2 ) is symmetric, |s m,) is symmetric —► total wave function is symmetric 

VKri, r 2 ) is antisymmetric, |s m t ) is symmetric -» total wave function is anti¬ 
symmetric 

1 A(ri, r 2 ) is symmetric, |s m s ) is antisymmetric -+ total wave function is anti¬ 
symmetric 

(ri, 1 * 2 ) is antisymmetric, |s m s ) is antisymmetric —► total wave function is 
symmetric. 




290 


Chapter 13 The Multiparticle Schrodinger Equation 


Thus, for bosons the spatial part of the wave function and the spin part of the 
wave function must either be both symmetric or both antisymmetric. For fermions, 
a symmetric spin state implies an antisymmetric spatial wave function, and an 
antisymmetric spin state implies a symmetric spatial wave function. Thus, two spin- 
1 /2 particles in the triplet state will have an antisymmetric spatial wave function, 
while two spin-1/2 particles in the singlet state will have a symmetric spatial wave 
function. It is impossible to determine the spatial wave function and spin state in 
isolation; the knowledge of one is needed to determine the possible choices for the 
other. 


Example 13.3. Two Identical Spin-1/2 Particles in an Infinite One- 
Dimensional Square Well. 

Two identical spin-1/2 particles are confined in an infinite one-dimensional square 
well with width a. Find the spatial wave function and energy for the lowest-energy 
singlet state and lowest-energy triplet state, respectively. 

The singlet state has an antisymmetric spin state and, therefore, a symmetric 
spatial wave function. Thus, the lowest energy state is the same as in Example 
13.2: the state m = n = 1: 




2 . 

/7r*n 

k ■ ( nx 2\ 

— — sm i- 

sin (-) 

a 

\ a > 

V a / 


with energy 


E u 


n 2 h 2 

ma 2 


For the triplet state, the spin state is symmetric, so the spatial wave function 
must be antisymmetric. For this case, mn is not allowed, and the lowest energy 
state corresponds to the antisymmetric wave function with m — 1, n = 2: 


1 2 

fn(x\,x 2 ) = -jz- l^srn 


/ni\ . (2nx 2 \ . (2nx\ \ . /nx 2 \ 


with energy 


2fe2 


E ,2 = 


7 x 1 h 
2 ma 2 
5jr 2 fi 2 
2 ma 2 


[l 2 + 2 2 ] 


The requirement that the fermion states must be antisymmetric has a profound 
effect on the behavior of electrons in atoms. This will be explored in more detail 
in the next section; here we examine the simplest multielectron atom: the helium 
atom. 



13.1 Wave Function for Identical Particles 


291 


Example 13.4. The Ground State of the Helium Atom. 

The full potential for the two electrons in a helium atom is given by Equation 
(13.3): 


V(r 1 ( r 2 ) 


2e 2 1 2e 2 1 e 2 1 

4tt€q n 4n*o r 2 4;reo |ri — r 2 | 


(13.13) 


As a first approximation, we neglect the final term which represents the mutual 
repulsion between the two electrons. In this limit the electrons are treated as inde¬ 
pendent particles which each feel the central Coulomb potential produced by the 
nucleus with charge +2e. 

Writing the spatial part of the one-particle wave function in the standard form 
V'n/m; > the lowest-energy wave function for a single electron in the potential V = 
—2e 2 /4n€or is 




e -2r/ao 


with energy 


E? e = 4 

where , the “ He ” superscript refers to the wave function and energy of a single 
electron in the Coulomb potential of the helium nucleus, and the “H” superscript 
refers to the corresponding quantities in hydrogen. Since E is the ground-state 
energy of hydrogen, we have E= —13.6 eV, so E^ e = —54.4 eV. 

For the helium atom to be in the lowest possible energy state, both electron 
wave functions must correspond to iffy' So the spatial part of the wave function 
is symmetric: 


^(n, r 2 ) = 

= -L e - 2r >/^ e -2^/«o (13.14) 

o 

Since the spatial part of the wave function is symmetric, and the electrons are 
fermions, which must have an antisymmetric total wave function, the spin part of 
the wave function must be antisymmetric. Thus, the two electrons in the helium 
atom must be in the singlet state: 

|0 0> = -~=| f |)--^|| t) 03-15) 

In elementary explanations of atomic structure, it is often stated that the atomic 
electrons fill the lowest possible energy states (in this case, the two states with 
n = 1, / = 0, mi = 0) but then to fulfill the Pauli exclusion principle, the electrons 
must have opposite spins. However, Equation (13.15) shows that this explanation 




292 


Chapter 13 The Multiparticle Schrodinger Equation 


is an oversimplification: the two electrons are not forced into a state, for instance 
where electron 1 has spin up and electron 2 has spin down. Rather, they are forced 
into the singlet spin state with total spin s — Oandm s = 0; the spins of the electrons 
are a mixture of | t 4-) and | | f)- 

The total energy corresponding to the spatial wave function in Equation (13.14) 
is E — 2E^ e ~ —108.8 eV, while the true ground-state energy of helium is 
—79.0 eV. Thus, our estimate of the ground-state energy is rather far off the mark. 
However, this estimate can be improved by treating the electron-electron repul¬ 
sion (the last term in Equation 13.13) as a perturbation and applying first-order 
perturbation theory. 

The change in the energy due to this repulsion is then 


E (| ) = 


/ 


d 3 rj d 3 r 2 


( ^ ^-2ri/ao c -2ri/oo\ ( _}_\ ( ^ £ ->!/«0g-2r 2 /ao 

) \4nco ki -r 2 \J \7za$ 


5 e 2 , 
4 4n €(,ao 


= 34.0 eV 


As expected, this change in the energy is positive since it represents a repul¬ 
sion between the two electrons. Adding it to the energy of the unperturbed two- 
particle wave function, we get an estimate of the total ground-state energy: E = 
— 108.8 eV + 34.0 eV = —74.8 eV. This differs from the true ground-state en- 
ergy by about 5%, which is not bad agreement (although the variational principle, 
Chapter 10, does provide a better estimate for this energy). 


These arguments that we have derived for the two-particle wave function can 
be extended to systems of more particles. For instance, the total wave function 
for three fermions must be antisymmetric under interchange of any two of the 
particles. If the spin part is symmetric, then this implies, for example, 

^(ri,r 2 ,r 3 ) = -^(r 2 ,ri,r 3 ) = -f(r u r 2 ,r 2 ) 

and so on. If ^i(r), fair), (r) are three different solutions to the one-particle 
Schrodinger equation, then the fully antisymmetric spatial wave function is given 

by 


Mri,r 2 , r 3 ) = -^[fairi)fa(r 2 )fa(rs) - fa(r 2 )fa(ri)fa{rj) 

- fairi)fairs)fair 2 ) - fairs)fair 2 )fai r i) 
+ f\iri) fairs) fai n) + fa{rs)fa{r\)fa{ri)\ 


This can be written in a particularly compact form as a determinant: 


j fa (IT) 
^(ri,r 2 ,r 3 ) = -= fa(r x ) 
V6 ih(ri) 


fa (r 2 ) 
fairi) 
fairi) 


V^fo) 

fairs) 

fairs) 



13.2 Multielectron Atoms 


293 


which is called a Slater determinant. Of course, the usefulness of this approach 
diminishes as the number of particles increases. For the uranium atom, for instance, 
the Slater determinant for the electrons is a 92 x 92 determinant, which expands 
out into 92! ~ 10 142 terms. (This expansion written out would far exceed the length 
of the entire visible universe.) Clearly, other approaches are called for. 


13.2 ■ MULTIELECTRON ATOMS 

The electron in hydrogen is characterized by four quantum numbers: n, l, mi, and 
Bij. For electrons in multielectron atoms, the electron-electron interactions alter the 
wave functions and energy levels, and it is not possible to solve the Schrodinger 
equation analytically. On the other hand, the electrons in a multielectron atom 
can still be characterized by the same set of quantum numbers, and the Pauli 
exclusion principle prevents any two electrons from sharing the same complete set 
of quantum numbers. 

Recall that for the hydrogen atom, the energy of a given state is entirely deter¬ 
mined by the principle quantum number n; all l, mi, and m s states for a given n 
have the same energy (aside from the small perturbations which were discussed 
in Chapter 9). In multielectron atoms, the electron-electron interactions break this 
degeneracy so that the energy of a given electron depends on both n and /. How¬ 
ever, the various mi and m, states for a given n and l are still degenerate in energy 
(since they represent quantities which depend on the orientation of the coordinate 
system). For this reason, the set of all possible m t and m s states for a fixed n and 
l is called a subshell; each subshell corresponds to a distinct energy level. Since 
mi ranges from —l to +/, and there are two spin states for each value of n, /, and 
mi, a given subshell can hold 2(21 + 1) electrons. For a given value of n, the set of 
all possible l, mi, and m s states is called a shell; each shell can hold In 2 electrons 
(Exercise 13.7). 

The notation which indicates the number of electrons occupying each subshell 
in a multielectron atom is rather arcane. Unfortunately, it is also rather standard, 
so we shall use it here. A given n, l subshell is denoted by a number (which gives 
n) and a letter (which gives l, in accordance with the previously-used notation: 
l — 0 is denoted by s,l = 1 is denoted by p, l = 2 is denoted by d, and so on). So 
the first few subshells in an atom are: 

Is —► n 
2s —*■ n 
2p —* n 
3s —» n 
3 p —► n 
3 d-+n 


= 1 , 1 = 0 
= 2 , 1=0 
= 2, l = 1 
= 3, 1=0 
= 3, 1 = 1 
= 3, 1 = 2 


The number of electrons occupying each subshell is indicated by a superscript. For 



294 


Chapter 13 The Multiparticle Schrodinger Equation 
instance, the ground state of boron is written as 


ls 2 2s 2 2p l (13.16) 

indicating that two electrons have n = 1 and l = 0, two electrons have n = 2 and 
1 = 0, and one electron has n = 2 and / = 1. 

Just as in hydrogen, the higher n states have higher energy, but what about the 
/ states? In general, states with higher l have wave functions which are peaked 
further from the nucleus (see, for example, Figure 6.7). Thus, the lower l states 
“feel” a stronger Coulomb attraction and have lower energy. For example, in the 
ground state of the lithium atom, which has three electrons, the first two electrons 
fall into the Is subshell, and the third electron can have either n = 2, / = 0 or 
n — 2, l = 1. Our argument indicates that the n = 2, l = 0 state has the lower 
energy, so the ground state of lithium is 

Is 2 2s 1 

In general then, subshells are filled first from lower n to higher n, and for a given 
value of n, from lower l to higher /; this is called the Aujbau principle. Thus, 

we expect the subshells to be filled in the order: Is, 2s, 2 p, 3s, 3 p, 3 d, 4s,_ 

'Tilts simple rule works well for the lightest atoms, but it breaks down at the 3d 
state. At this point screening by the inner electrons becomes so important that the 
n = 4, / = 0 state is pulled below the n = 3, ( = 2 state, so that the order in which 

the subshells are actually filled is: Is, 2s, 2p, 3s, 3 p, 4s, 3d .For heavier 

elements, the order in which the subshells are filled becomes even more compli¬ 
cated. 

This filling of subshells and shells is the basis for all of chemistry. A shell 
with a full set of electrons, called a closed shell, plays no role in binding to other 
atoms; it is the electrons in a partially filled shell which determine the chemical 
behavior of an atom. (Note that it is a filled shell, not a filled subshell, which renders 
the electrons inactive as far as chemical activity is concerned.) Atoms consisting 
entirely of closed shells are chemically inert. Because these atoms cannot bind 
even to each other, the corresponding elements are all gasses at room temperature: 
the “noble gasses.” The first few such atoms with their electron configurations are: 

helium Is 2 

neon Is 2 2s 2 2p b 

argon \s 2 2s 2 2p 6 3s 2 3p 6 

Note that the n = 3 shell for argon is considered closed despite the fact that there 
are no electrons in the 3d subshell. As noted earlier this is the point at which our 
simple rules for filling subshells break down, with the 3d states having higher 
energy than the 4s states. 



13.2 Multielectron Atoms 


295 


The alkali metals (lithium, sodium, potassium, etc.) have a single electron in an 
unfilled shell, making them very reactive as electron donors: 

lithium Is 2 2s 
sodium Is 2 2 s 2 2p 6 3s 
potassium Is 2 2s 2 2p 6 3s 2 3 p 6 

Similarly, the halogens (fluorine, chlorine, bromine,...) are one electron short of 
a filled shell, rendering them prone to grab electrons from other atoms. This shell 
filling, of course, is what produces the periodic table. 

A given atom can also be characterized by its angular momentum states. These 
states are labeled by the quantum numbers X, 5, and 7, which correspond to the 
total orbital angular momentum for the electrons in the atom L, the total spin 
angular momentum quantum number for the atomic electrons 5, and the total 
angular momentum 7. (We use uppercase letters to denote total angular momenta, 
and lowercase letters to denote angular momenta of individual electrons. Note 
that the angular momentum of the nucleus is ignored here.) The way in which the 
individual spins and orbital angular momenta of the electrons in an atom couple to 
give L, 5, and 7 depends on the number of electrons in the atom. For the lightest 
elements, it is a good approximation to assume that all of the individual orbital 
angular momenta couple to give the value for X, and all of the individual spin 
angular momenta couple to give the value for 5. These total values for L and 5 
then couple, in accordance with the relation 

|£-S|<7<L + 5 

to give the total angular momentum 7. This coupling scheme is called L-S coupling 
or Russell-Saunders coupling, and it is the only one we will consider in detail here. 
For some heavy elements, it is a better approximation to assume that the individual 
/ and s of each electron couple to give a distinct total angular momentum j for 
each electron; all of these j's then couple to give the total 7. This is called j-j 
coupling and will not be considered further. While a given atom can have a variety 
of values for L, 5, and 7, the ground state of a given atom always has a unique 
set of these quantum numbers. (Note that the values of X, 5, and 7 for the ground 
state of an atom have no particular importance for its chemical behavior.) 

Consider first the simplest case, hydrogen. The ground state has a single electron 
with l = 0 and s = 1/2, which means that the total orbital angular momentum and 
total spin angular momentum must also be L = 0 and 5 = 1/2. These can couple 
to give only a single value for 7, namely, 7 = 1/2. This is written in the rather 
unusual (but standard!) notation introduced in Chapter 9; the value for L is written 
as a letter (in this case 5), and the value of 7 is written as a subscript to give 5 j/ 2- 
Now we must add a superscript to denote the value of 5, but instead of writing the 
value of 5, we write the value of 25 + 1 as an upper left-hand superscript (don’t 
blame me; I didn’t invent this). Thus, the full set of values for L, 5, and 7 are given 



296 


Chapter 13 The Multiparticle Schrodinger Equation 


as 


2S+1 Z,y | 

In this notation, the ground state of hydrogen is written as 

2 Sl/2 

(Note that capital letters 5, P, D,... denote the total orbital angular momentum of 
an atom, while small letters s, p,d,... such as we used in describing the electron 
configuration, refer to the angula momentum states of individual electrons.) 

Moving on to helium, recall from the previous section that the electrons in the 
ground state must be in the singlet state, so that 5 = 0. Further, both electrons 
occupy 1=0 states, so they can only couple to give L = 0. Finally 5 = 0 and 
L = 0 can only give J = 0, so we get 


1 So 

as the ground state of helium. This result can be generalized: the electrons in a 
closed subshell always pair off to give L = 0, 5 = 0, and J = 0; it is the electrons 
in partially-filled subshells which then determine the total angular momentum 
state. (Note the difference from chemistry, where electrons become chemically 
irrelevant only if the full shell is filled, not a subshell.)’ 

Hydrogen and helium are relatively straightforward examples in the sense that 
the angular momenta can couple to give only a single unique set of quantum num¬ 
bers. Now consider an example where this is not the case. Boron has 5 electrons, 
in the configuration shown in Equation (13.16). The closed subshells contribute 
nothing to the angular momentum, which is determined entirely by the single elec¬ 
tron with l = l and spin 1/2. Thus, we must have L = 1 and 5 = 1/2. However, 
this leads to two possible values for J: J = 1/2 or J = 3/2. Which is the lowest 
energy state? 

To determine the ground state for boron (and other atoms), a set of empirical 
rules (called Hund’s rules ) provides a guide to choosing the values of 5, L, and J 
that give the lowest energy state. These rules are applied in the following order: 

Hund’s Rule #1: Given more than one allowed value for 5, choose the largest 
possible value. 

Hund’s Rule #2: Given more than one allowed value for L, choose the largest 
possible value. 

Hund’s Rule #3: Given more than one allowed value for J , choose the smallest 
possible value for J if the subshell under consideration is less than half full, and 
choose the largest possible value for J if the subshell is more than half full. 

Applying these rules to the case of boron, we see that Rule #1 and Rule #2 are 
irrelevant; we have only a single possible value for 5 and for L. It is Rule #3 which 
determines the ground state. The 2 p subshell can hold six electrons, so with only a 
single electron, it is clearly less than half full. Therefore, Rule #3 tells us to choose 




13.2 Multielectron Atoms 


297 


the smallest possible value for J, which in this case is J = 1/2. This gives the 
ground state of boron: 


2 Pi/2 

When there is more than one electron in an unfilled shell, things become much 
more complex as illustrated in the next example. 


Example 13.5. The Ground State of Carbon. 

The electron configuration for carbon is 

Is 2 2 s 2 2p 2 

The unfilled subshell contains two electrons, each with spin 1 /2 and / = 1. Thus, 
they can couple to give 


L = 0, 1,2 
and 

5 = 0,1 

Together, these yield six possible pairs for L and 5: 


L = 0, 5 = 0 (13.17) 

L = 1, 5 = 0 (13.18) 

L = 2, 5 = 0 (13.19) 

L = 0, 5=1 (13.20) 

L = 1, 5=1 (13.21) 

L = 2, 5 = 1 (13.22) 


However, not all of these possible states are allowed; some of them can only be 
obtained by putting electrons into the same m* and m s states, violating the exclusion 
principle. This is a subtle point which requires further exploration. Each electron 
can have m; = —1,0, or 1, and m s = +1 /2 or — 1 /2. We write the mi , m s state for 
the two electrons in Dirac notation as |m;(l) m.j(l) m;(2) m J (2)), where (1) and 
(2) denote the first and second electron, respectively. To avoid confusion, we use 
the numerical value form; and arrow notation for m s . In this notation, for instance, 
a state for which the first electron has m; = 1, m s = 1/2 and the second electron 
has mi = 0, m s = — 1 /2 would be written as 


li t (H> 

We can now catalog all possible states for the two electrons in terms of their 
mi and m s values. In doing so we recall that states for which the two electrons 
have the same values for m/ and m, are not allowed by the exclusion principle. For 
example, the state [Of 0 t) is excluded. Further, we don’t want to double-count 



298 


Chapter 13 The Multipartide Schrodinger Equation 


states which are obtained by interchanging the two electrons; e.g., the states 
11 t 0 |} and |0 f If) are identical, so we will list only one of them. With 
these cautionary notes, we proceed to list all of the allowed states for m/ and m s 
for the two electrons: 


11 

t 

1f) 


11 

t 

Of) 


11 

t 

Of) 


si 

t 

-1 

t) 

11 

t 

-1 

f) 

11 

f 

Of) 


11 

f 

Of) 


11 

f 

-1 

t> 

11 

1 

-1 

f) 

10 

t 

Of) 


|0 

t 

-1 

t) 

|0 

t 

-1 

f) 

!0 

f 

-1 

t) 

|0 

t 

-1 

f) 

1- 

1 

t - 

If) 


For each of these states, we then determine the total values Ml and Ms, which are 


simply the sums Ml = m/(l) + m/(2) and Ms = m. 

H f If) -► Ml =2, 

s (l) + m J (2): 

Ms = 0 

(13.23) 

U t Of) 

-+ M l = 1, 

M s = 1 

(13.24) 

(If Of) 

M l = 1, 

o 

II 

5 

(13.25) 

lit -It) 

-> M l = 0, 

M s = 1 

(13.26) 

U t -H) 

—► M l = 0, 

Ms = 0 

(13.27) 

Ilf 0f> 

^ Ml = 1, 

M s = 0 

(13.28) 

IH Of) 

-+Ml = 1, 

M s = -1 

(13.29) 

11 f -It) 

o' 

II 

-J 

t 

O 

II 

to 

% 

(13.30) 

Ilf -If) 

o" 

II 

-J 

t 

Ms = —1 

(13.31) 

lot Of) 

Ml = 0, 

o 

II 

to 

5 

(13.32) 

10 t -It) 

-> M l = - 1, 

M s = 1 

(13.33) 

|0t -If) 

-> M l = - 1, 

o 

II 

£ 

(13.34) 



13.2 Multielectron Atoms 



299 

10 4- -if) 

M l = -1, 

M s =0 

(13.35) 

10 i - H) 

-*■ M l = -1, 

M$ = — 1 

(13.36) 

1 - 1 t - U> 

<N 

1 

II 

£ 

t 

II 

o 

(13.37) 


Now note an important point: some oftheL, 5 states listed in Equations (13.17)- 
(13.22) will lead to values of Ml and Ms which do not appear in Equations (13.23)- 
(13.37). These values of L and 5 must be excluded; the physical reason they cannot 
exist is that they violate the exclusion principle. For instance, one of our six L, S 
pairs is L = 2, 5 = 1. If this was a possible state for the electrons, then it would 
lead to all possible pairs of the states Ml = —2, —1,0,1,2 and M$ = — 1, 0,1 
in Equations (13.23)—(13.37). While some of these states are observed, others are 
not. For instance, we do not see Ml = 2, Ms = 1. The reason for this is that this 
state can arise only if both electrons have mi = 1 and m s = 1/2; however, the 
exclusion principle prevents both electrons from being in this same state. Thus, 
the state L = 2, 5 = 1 is ruled out. The task is then to find a subset of the L, S 
states from the six given in Equations (13.17)—(13.22) which produces all of the 
Ml and Ms states in Equations (13.23)—(13.37), but which does not produce any 
“extra” M L , M s states not on this list. It’s a bit like solving a puzzle. 

Having ruled out the state L^= 2,5 = 1, we can now rule in several other states. 
The state L = 1, 5 = 1 must be allowed, since this is the only other L, S state 
which can give us M L = l, M s = 1, which is seen in Equation (13.24). This L, 5 
state then produces all of the following values of Ml and Ms- 


M l = - 1, 

Ms - -1 

(13.38) 

Ml = —1, 

II 

o 

(13.39) 

Ml = — 1, 

M s = 1 

(13.40) 

II 

p 

M s — -1 

(13.41) 

S: 

r- 

II 

p 

II 

o 

(13.42) 

II 

p 

M s = 1 

(13.43) 

Ml = 1, 

M s = - 1 

(13.44) 

M l = 1, 

M s = 0 

(13.45) 

M l = 1, 

M s = 1 

(13.46) 


Similarly, the state L = 2, 5 = 0 is the only remaining state which can give Ml = 
2, 5 = 0, seen in Equation (13.23). This state produces the following five Ml, Ms 
states: 


M l = -2, 

o 

II 

to 

5 

(13.47) 

M l = -1, 

O 

II 

£ 

(13.48) 

5 

II 

O 

o 

II 

to 

5 

(13.49) 

Ml = 1, 

o 

II 

to 

(13.50) 

Ml =2, 

Ms = 0 

(13.51) 



300 


Chapter 13 The Multiparticle Schrodinger Equation, 


When we remove the 14 states given by Equations (13.38)—(13.51) from the list 
of states given by Equations (13.23H 13.37), we are left with only a single state 
to account for: M L = 0, Ms = 0. We can produce this state (and only this state) 
by taking L = 0, 5 = 0, 

Therefore, of the six possible pairs of 7, 5 states identified in Equations (13.17)- 
(13.22), only three are actually allowed: 

L -2, 5 = 0 

which can give only 7 = 2, 

7 = 0, 5 = 0 

which can give only 7=0, and 

7 = 1, 5=1 


which can give 7 = 0, 1, or 2. 

Now Hund’s rules can be used to find which of these states is the ground 
state. Hund’s Rule #1 instructs us to choose the state with the largest 5, which is 
7 = 1, 5 = 1. Hund’s Rule #2 is irrelevant, since we have only a single L value 
(L = 1) corresponding to 5 = 1. Finally, the 2 p subshell can hold 6 electrons, and 
it contains only 2 in this case, so it is less than half full. Thus, Hund’s Rule #3 
indicates that the ground state is the lowest allowed 7 state: 7=0. Therefore, the 
ground state is 7 = 1, 5 = 1, 7 = 0, written as 

3 Pa 


Clearly, complex calculations of the sort given in Example 13.5 arise only when 
there are two are more electrons in an unfilled subshell. Atoms in which all of the 
subshells are filled have 7 = 0, 5 = 0, and 7 = 0. Note that this includes atoms 
which are not chemically “inert”, since an atom can have all of its subshells filled 
but still have an unfilled shell. For instance, beryllium (with four electrons) has 
the electron configuration 


Is 2 2 s 1 

Both of its subshells are filled, giving a ground-state angular momentum of 1 Sq. 
However, the n = 2 shell is unfilled (it can hold 8 electrons) so beryllium is quite 
chemically reactive. Similarly, any atom with a single electron in an unfilled sub¬ 
shell will have the angular momentum quantum numbers of that electron, as in the 
case of boron considered above. 

We have barely scratched the surface of the study of multielectron atoms. Atomic 
physics remains an active area of research with many problems still being explored. 
However, almost this entire field traces back to fundamental concepts from quan¬ 
tum mechanics. 



Exercises 


301 


EXERCISES 

13.1 Consider the two-particle Hamiltonian given by 

Show that the exchange operator £12 commutes with H as long as the two-particle 
potential has the property that V (r t , r 2 ) = V(r 2t iq). 

13.2 Two identical spin-1/2 particles with mass m are in a one-dimensional infinite square- 
well potential with width a, so V^jc) = 0 for 0 < x < a, and there are infinite poten¬ 
tial barriers at x = 0 and x = a. The particles do not interact with each other; they 
see only the infinite square-well potential 

(a) Calculate the energies of the three lowest-energy singlet states. 

(b) Calculate the energies of the three lowest-energy triplet states. 

(c) Suppose that the particles are in a state with wave function 

1 2 T nx\ . lnx 2 . nx 2 . Inxi 1 
y (*j, jc 2 ) = —f - sin-sin-b sin-sin- 

V 2 a | a a a a J 

where x\ is the position of particle 1 and x 2 is the position of particle 2. Are the 
particles in a triplet spin state or a singlet spin state? Explain, 

13*3 Two identical spin-0 particles with mass m are confined inside a three-dimensional 
rectangular box given by 0 < x < a, 0 < y < b, and 0 < z < c, where a < b < c, 
and the potential barriers at the walls of the box are infinitely high. The particles do 
not interact with each other; they see only the potential of the box. Write down the 
normalized wave function >’i, zi, x 2 , y 2 , zi) for the ground state, and indicate 
the energy of the ground state. 

13*4 (a) T\vo identical spin-1/2 particles are confined inside of the rectangular box from 
Exercise 13.3, The particles do not interact with each other; they see only the 
potential of the box. Write down the normalized spatial part of the lowest-energy 
singlet wave function. What is the energy of this state? 

(b) Write down the normalized spatial part of the lowest-energy triplet wave func¬ 
tion, What is the energy of this state? 

(c) What is the total spin s of the ground-state wave function for the system? 

13.5 Two identical spin-1/2 particles are confined to an infinite one-dimensional square 
well of width a with infinite potential barriers at x = 0 and x = a. The potential is 
V(x) = 0 for 0 < x < a. Suppose that the particles interact weakly by the potential 
V] (jc) = KS(x\ — x 2 ), where x\ and x 2 are the positions of the two particles, K is a 
constant, and 8 is the Dirac delta function. This represents a very short-range weak 
force between the two particles. 

(a) Using first-order perturbation theory, find the perturbation to the energy of the 
lowest-energy singlet state. 

(b) Show that the first-order perturbation to the energy of the lowest-energy triplet 
state is zero. 

(c) What is the physical reason for the answer in part (b)? 



302 


Chapter 13 The Muitiparticle Schrodinger Equation 


13*6 T\vo identical spin-1/2 particles are in the one-dimensional simple harmonic oscil¬ 
lator potential V (x) — (1/2) Kx 2 . The particles do not interact with each other; they 
see only the harmonic oscillator potential The particles are in the lowest-energy 
triplet state (s = 1). 

(a) Write down the normalized spatial part of the wave function. 

(b) Calculate the energy of this state. 

(c) If the positions of both particles are measured, what is the probability that both 
particles will be located on the right-hand side of the minimum in the potential 
(i.e., the probability that both particles have x > 0)? 

13.7 Show that the n shell in an atom can hold 2 n 2 electrons. 

13.8 Sodium has Z = 11. Determine the ground-state electron configuration. 

In Exercises 13.9-13.12, express all angular momentum states in the notation ls + l L } . 

-i 

13.9 Determine the ground-state L , S, and J values for 

(a) calcium which has the electron configuration 

Is 2 2s 1 2p 6 3s 2 3p 6 4s 2 

(b) yttrium which has the electron configuration 

Is 2 Is 2 2p 6 3s 2 3 p € 3 d ]0 4s 2 4p 6 4d l 5s 2 

13.10 Consider the excited state of beryllium with the electron configuration 

\s 2 3 p ] 3 d' 

Determine all possible L, S , and J values. Note that the Pauli exclusion principle 
for the two n = 3 electrons can be ignored; why? 

13.11 Zirconium (Z = 40) consists of closed subshells plus 2 electrons in an unfilled d 
subshell Derive the set of allowed L, 5, and J values, and determine which state 
has the lowest energy. 

13.12 The electron configuration for nitrogen is 

Is 2 2s 2 2 p 3 


Calculate L, S, and J for the ground state. 



CHAPTER 



Some Modern Applications of 
Quantum Mechanics 


The first half of the 20th century saw two revolutions in physics, culminating in the 
theories of quantum mechanics and general relativity. Of these two, quantum me¬ 
chanics has certainly had a more profound impact on the subsequent development 
of physics. Quantum mechanics provides the basis for nuclear physics, particle 
physics, and solid state physics, with applications in almost all other fields of 
physics. Furthermore, it has yielded a surprising number of practical applications. 
In this chapter we examine just two of these, in which the use of quantum me¬ 
chanics is particularly straightforward: magnetic resonance imaging and quantum 
computing. Magnetic resonance imaging is already a well-developed technology, 
while quantum computing is still very much in the formative stage. These are 
presented merely as representative examples; there are many others. 


14.1 ■ MAGNETIC RESONANCE IMAGING 


Imagine that a patient needs to undergo a magnetic resonance imaging (MRI) scan 
(Figure 14.1). The patient first removes any objects that can affect or be affected 
by magnetic fields (jewelry, credit cards, metallic objects), and then lies on a 
padded bench which slides into a long tube. The only sounds are occasional loud 
thumping noises. After 30-90 minutes, the patient slides back out again, and the 
exam is finished. 

The development of magnetic resonance imaging technology represents an 
enormous advancement in diagnostic medicine. MRI scans are among the least 
invasive and safest of all medical tests. Unlike X-rays, they appear to have abso¬ 
lutely no harmful effects, and they allow physicians to image soft tissues, especial ly 
brain and nerve tissue, which are difficult to examine with conventional X-rays. 
Real-time imagining of the brain has opened up new areas of research in neuro¬ 
science, as investigators are able to see different parts of the brain “light up” in 
response to different stimuli. And it’s all derived from quantum mechanics. 

Magnetic resonance imaging involves the interaction of external magnetic fields 
with the protons in hydrogen atoms. Consider a hydrogen atom placed in a strong 
magnetic field (Figure 14.2). The proton magnetic moment fi p is given by an 
expression similar to that for the electron magnetic moment in Equation (8.7): 


V-p - 


gpjlB t»e_ s 

h m p 


(14.1) 


303 



304 


Chapter 14 Some Modern Applications of Quantum Mechanics 



Courtesy of John Gore, Vanderbilt University 

FIGURE 14.1 A magnetic resonance imaging (MRI) machine. 


i 

t i 


L A 

z 

1 y 

z.. 

6 o 

ii 

o o 

o o 


FIGURE 14.2 The protons in hydrogen placed in a magnetic field B 0 = B 0 z will line up 
parallel or antiparallel to the field. 


where g p = 5.59 is determined experimentally. The factor multiplying S is positive 
in this equation and negative in Equation (8.7) because the proton and electron have 
opposite charges, and the ratio of m e to m p in Equation (14.1) arises from the fact 
that the magnetic moment is inversely proportional to the mass of the particle 
(Equation 8.5). As before, jig is the Bohr magneton, (xb — 9.3 x 10 -24 A-m 2 , and 
S is the spin. 

Recall from Chapter 8 that the potential experienced by the proton in an external 
magnetic field B is 


V = ~fi p • B 







14.1 Magnetic Resonance Imaging 


305 




FIGURE 14.3 An oscillating magnetic field in the x direction causes some of the proton 
spins to flip. 


If, for instance, B is a static field in the z direction with magnitude Bo, 


Bq = Bqz 


then the protons will be forced into eigenstates of the Hamiltonian and will line up 
with spins either in the +z direction with energy E = —(g p fiB/2)(m e /m p )Bo, or 
in the — z direction with energy E — (g p ix B / 2)(m c //n p )Bo (Figure 14.2). 

Now suppose that we add a perturbation in the form of an electromagnetic 
wave. The magnetic field generated by this wave will have the form B t cos (cut). 
This applied magnetic field can be chosen to be perpendicular to the z-axis; for 
simplicity we will take it to be in the x direction, so that 

Bj — B\ cos(<ur)Jc 

What happens to the proton spins when this oscillating magnetic field is applied? 
This problem was previously discussed in Example 11.2 for the case of an electron. 
It was found that the applied field produces a nonzero probability for the electron to 
flip into the opposite z spin state (Figure 14.3). Using the results of Example 11.2, 
but replacing the electron magnetic moment with the proton magnetic moment, 
gives 


Bif*l sin 2 [(m-mo)r/2] 

4 h 1 [(a> - cuo)/2] 2 


(14.2) 


where P{i —► /) is the transition probability for a proton to go from a lower energy 
(spin up) state to a higher energy (spin down) state. The frequency ct>o in this case 
is 


<uo = 2fi B (g p /2)(m e /m p )B 0 /h 

The important point is that the transition probability in Equation (14.2) is sharply 
peaked around co Ru coq. In order to drive a large number of transitions, the applied 



306 


Chapter 14 Some Modern Applications of Quantum Mechanics 



FIGURE 14.4 As the protons relax back into the lower-energy spin state, they precess 
about the static magnetic field. 


frequency of the radiation should be close to this resonance frequency. A typical 
MRI field strength is B 0 = 1.5 T; using this value, we obtain coq — 4.0 x 10 8 sec -1 , 
corresponding to a frequency of 

u = (o/2n — 6.4 x 10 7 Hz 

Thus, the applied frequency needs to be in the MHz region, corresponding to radio 
frequencies. 

After the spins flip into the high-energy state, they will relax back into the low- 
energy state. As they do so, they will develop a spin component (and therefore 
a magnetic moment component) perpendicular to the strong static magnetic field 
Bqz, causing them to precess about this field (Figure 14.4). We examined spin 
precession in Chapter 8, where we found that the angular frequency of precession 
for an electron with spin perpendicular to the magnetic field B was co = 2 /xb B/h. 
Again, we must change the electron magnetic moment /ig to the proton magnetic 
moment /jl p , so that the precession frequency for the protons is 

o) = 2fx p B 0 /h = 2fx B (gp/2Xm e /m p )B 0 /h 

In fact, this is exactly the same as the applied frequency that maximized the prob¬ 
ability for the protons to flip! As the protons precess, they emit radiation at the 
precession frequency. As noted, this frequency is in the MHz range, so that the pre- 
cessing protons give off electromagnetic radiation at radio frequencies. This radia¬ 
tion can then be detected and used to map the emitting protons. Thus, MRI allows 
the direct imaging of hydrogen atoms in the body (Figure 14.5). Tuning MRI to de¬ 
tect hydrogen atoms makes sense, since hydrogen is the most abundant element (by 
number) in the human body. (This picture is an oversimplification, since the protons 
actually precess about the combined total magnetic field, B = Bo + Bj cos(a>0> 
but it gives a reasonably accurate picture of how MRI works.) 



14.1 Magnetic Resonance Imaging 


307 



Courtesy of the Department of Diagnostic Radiology, Yale University School of Medicine 
FIGURE 14.5 An example of the image produced by an MRI machine. 


MRI has several advantages over X-rays. Perhaps the biggest advantage is one 
of safety: X-rays ionize atoms in the body, leading to the danger of cancer at low 
doses and actual destruction of tissue at high doses. In contrast, no health risk from 
strong magnetic fields has ever been demonstrated. MRI images soft tissues more 
effectively than X-rays, which scatter less effectively off of low-density tissues 
than off of high-density materials such as bone. Furthermore, since MRI couples 
to hydrogen atoms, it can actually provide information on the chemical content of 
body tissues. The major drawback of MRI (compared with conventional X-rays) 
is its high cost. 

It is amusing to note that MRI was originally (and accurately) called “nuclear 
magnetic resonance,” since it relies on the resonant flip of the atomic nuclei in 
magnetic fields, with subsequent precession and re-emission of radiation at the 
resonant frequency. However, patients were disturbed by the use of the word “nu¬ 
clear,” which conjured images of nuclear weapons, nuclear waste, etc. This was 
particularly unfortunate given that the process itself is so noninvasive and benign. 
Hence it was repackaged as “magnetic resonance imaging ” avoiding the negative 
connotations of the word “nuclear.” 



308 


Chapter 14 Some Modern Applications of Quantum Mechanics 


14.2 ■ QUANTUM COMPUTING 

We now examine a more recent, and completely different, application of quantum 
mechanics: quantum computing. Classical computing is based on binary digits or 
bits. For instance, a two-bit computer can be in one of the following four states: 

0 0 
0 1 
1 0 
1 1 

where 0 and 1 can represent, electronically, circuits which are “on” or “off’ or, 
logically, states that are “true” or “false.” A sequence of n bits can be in 2" possible 
states; this provides a measure of the information storage in such a system. Note, 
however, that the system can be in only one of these states at any given time, so 
the computer can operate on only one state at a time. 

Now consider a quantum analog: a system of two spin-1/2 particles. Just as in 
the case of the two-bit circuit, each particle can be in one of two states, so there 
are four possible states all together: 


II I) 

II t) 

It I) 

It t> 

Now “spin down” and “spin up” represent the “0” and “1” states, respectively, of a 
classical computer. These quantum bits are called qubits. Note, however, that the 
system need not be in a single state; it can be in a superposition of all four states, 
e.g., 


W=aill l) + «2ll t) + a 3 lt l) + 04lt t) 

It is possible then to access all four of these spin states simultaneously, since 
each of them contributes to | tfr). One can also imagine operating on this linear 
combination of spin states to obtain a second linear combination of the spin states 
for the two particles. This represents a radical degree of parallel computing: all 
four two-qubit states can be manipulated simultaneously. This parallelism becomes 
.more pronounced as the number of qubits increases; with ten qubits, for example, 
a million spin states can be accessed simultaneously! 

ft is possible to carry this analogy further and actually perform logical operations 
on the qubits. First, recall the sort of logic gates that are possible in a classical 
computer. Binary gates such as AND and OR take two bits and produce a single 
output bit (Figure. 14.6). If 1 and 0 represent “true” and “false,” respectively, then 
the AND gate produces an output of 1 if and only if both input bits are 1; otherwise 
it produces 0. Similarly, the OR gate produces 1 if either bit is 1; it produces 0 



14.2 Quantum Computing 


309 


AND 


OR 


XOR 



FIGURE 14.6 The classical AND, OR, and XOR gates take in two bits and produce a 
one-bit output. 


only if both bits are 0. A third example is the XOR, or exclusive OR gate, which 
is identical to the OR gate with the exception that it produces a “false” output if 
both inputs are “true,” 

In order to produce the quantum analog of these logic gates, we first need a 
slightly different representation of our four two-particle spin states. We represent 
the four states as column vectors in the following way: 


u 

II 

It 

It 




t) 4F 


\0. 


1) O 


t) ^ 


(14.3) 


(14.4) 


(14.5) 


(14.6) 


As noted, the spin-down state represents a “0” or “false” bit, and the spin-up 
state represents a “1” or “true” state. A logic gate can then be represented as 




310 


Chapter 14 Some Modern Applications of Quantum Mechanics 


multiplication by a 4 x 4 matrix. As an example, the matrix representing the XOR 
gate is 


n o 

Uxor = q 0 

\0 0 


0 0 \ 
0 0 
0 1 
1 0 / 


Using this matrix representation as well as the column vector representations in 
Equations (14.3)—(14.6), we can derive how Uxor operates on our four spin states: 


UxOrI l I) — I i i) 
Uxor I i t) = H t) 
Uxoalt i) = 11 t) 
Uxoalt t) = lt I) 


(14.7) 


In what sense can this result be treated as an XOR logic gate? The two inputs are 
simply the spin states of the two particles. The output of the logic gate is read off 
of the spin state of the second particle, while the spin state of the first particle is left 
unchanged. For instance, in Equation (14.7), the input spins of the two particles 
represent “true” and “false,” respectively, so the XOR output should be “true.” 
Hence, the spin of the second particle is set to spin up, while the spin of the first 
particle remains unchanged by the operator. Clearly, this differs from a classical 
logic gate in that there are two outputs rather than one: the spin of the first particle 
(which never changes) and the spin of the second particle (which corresponds to the 
output of the classical XOR gate). There is a reason for this: quantum mechanics 
is invariant under time reversal, so any quantum mechanical operation must be 
reversible. An operation such as a classical logic gate, which takes two inputs and 
produces only a single output, reduces the total information in the system; one 
cannot, in general, reconstruct the input values that go into a classical logic gate if 
only the output is known. Hence, quantum mechanical logic gates must have two 
outputs. Note, as emphasized earlier, that the quantum XOR gate can operate on a 
linear superposition of spin states, producing a linear superposition of outputs. 

It is also possible to produce quantum logic circuits with no classical analog. 
Consider, instead of a two-qubit system, a single-qubit system with the two possible 
states 


1 t ) ^ TRUE 

U > ^ FALSE 

The NOT gate gives an output of TRUE if the input is FALSE and FALSE if the 



Exercises 


311 


input is TRUE. In matrix form, this operator can be written as 



T he NO T gate is a well-behaved classical logic gate. Now, however, consider the 
VNOT gate, given by 


U 


NOT 


1/1-i 1 +i\ 
2\ 14-i l-i) 


It is possible to show (Exercise 14.7) that UU = (/not, so that applying 
the operation \/NOT twice yields NOT. There is, however, no classical logic gate 
with this property. 

This all sounds fine in theory, but is it actually possible to design a quantum 
computing system to perform useful calculations? A breakthrough in this area was 
achieved in 1994 at Bell Labs by Peter Shor, who devised a quantum algorithm for 
factoring prime numbers. This algorithm was successfully implemented in 2001 
at the IBM Almaden Research Center. The IBM scientists constructed a molecule 
consisting of five fluorine-19 atoms and two carbon-13 atoms, with the spins of the 
atomic nuclei serving as the qubits. These nuclear spins were manipulated using 
nuclear magnetic resonance technology similar to that described in the previous 
section. This quantum computer succeeded in factoring the number 15 (into 3 and 
5). Obviously, quantum computing has a way to go before it becomes competitive 
with classical computers! However, it is an area of intense current research. 


EXERCISES 

14.1 Oxygen is the second most abundant eleme nt in the human body (by number) and the 
most abundant by mass. However, MRl detection of oxygen atoms is not practical. 
Why? 

14.2 Electrons in hydrogen have a much larger magnetic moment (both orbital and spin) 
than the magnetic moment of the proton. Why then are MRl machines not tuned, 
for example, to the resonant frequency of the spin magnetic moment of the electron 
rather than the proton? 

14.3 Consider an MRl machine with a 1.5 T static field. How far from the resonant 
frequency would one have to be in order for the spin-flip probability to decrease 
from its maximum value to each of the following? 

(a) one-half of the maximum value 

(b) zero 

14.4 The random thermal energy of each molecule in the human body is roughly E kT, 
where k is Boltzmann’s constant (k = 1.38 x 10 -23 J K 1 ) and T is the temperature. 
Compare this thermal energy to the potential energy experienced by a proton in a 
1.5 T MRl machine. What does the answer say about the efficiency with which 
protons will align into lower energy states in such a magnetic field? 



312 


Chapter 14 Some Modern Applications of Quantum Mechanics 


14.5 (a) Verify that the matrix corresponding to {/not produces the correct output when 

applied to the | | } and | j ) states. 

(b) Compute the matrix corresponding to OT , and calculate the result when it is 
applied to | ) and If). What logical operation does U^ ov correspond to? 

14.6 (a) Show explicitly that the classical XOR gate cannot be inverted. 

(b) Calculate the matrix corresponding to the inverse of the quantum XOR gate. 

14.7 Show that UjmT = ^not- 

14.8 Apply Uxor to the mixed state (l/*/2)(| I I) + I t t))- Explain what the result 
means. 



C H A PT E R 


What Comes Next? Relativistic 
Quantum Mechanics 



The theory of quantum mechanics that we have developed thus far is based on the 
nonrelativistic definition of energy, namely, 

p 2 

E = f- + V (15.1) 

2m 

The replacement of p, V, and E with the appropriate operators leads to the 
Schrodinger equation. However, special relativity, which predates the Schrodinger 
equation by 20 years, indicates that Equation (15.1) is only an approximation valid 
at low velocities. When particle velocities become comparable to the speed of light, 
this equation breaks down. In this chapter we will examine what happens when 
we attempt to incorporate special relativity into quantum mechanics; the result is 
called relativistic quantum mechanics. 


15.1 BTHE KLE1N-GORDON EQUATION 


Derivation of the Klein-Gordon Equation 

As noted, the relationship between momentum and energy given by Equation (15.1) 
is valid only in the limit of low velocities, v <5C c, where c is the speed of light. 
In special relativity, Einstein generalized this equation to give the correct relation 
between p and E at all velocities: 


E 2 = p 2 c 2 + m 2 c A 


(15.2) 


where we have assumed a free particle with no potential (we will assume V — 0 
throughout this chapter), andm is the mass of the particle at rest, which is a constant. 
(In dealing with relativistic quantities, it is possible to simplify the equations 
considerably by setting, c = 1. We will resist the urge to do that here, but it is 
important to be aware that it is often done.) In the limit where u -C c, it is possible 
to show (Exercise 15.1) that Equation (15.2) reduces to 


E =-?- +me 2 (15.3) 

2m 

This equation is similar to Equation (15.1) for the case V = 0, but there is an extra 
term on the right-hand side of Equation (15.3), corresponding to an extra contri¬ 
bution to the energy: E = me 2 . In special relativity, this is called the rest energy 


313 




314 


Chapter 15 What Comes Next? Relativistic Quantum Mechanics 


of the particle, and it must be included in the total energy. Note that nonrelativistic 
quantum mechanics ignores this rest energy, but it is included in the equations of 
relativistic quantum mechanics. 

In order to derive an equation for the wave function that corresponds to Equation 
(15.2), we follow a procedure very similar to our “derivation” of the Schrodinger 
equation in Chapter 3. Assume that we have a wave function <p{ r, t) that is an 
eigenfunction of the energy operator ih(d/dt) with eigenvalue E, and also an 
eigenfunction of the momentum operator — ihV with eigenvalue p. In that case, 
we have 

(j'h) 1 *-** 

and 


(-i hW) 2 (j) = p 2 <p 
and we can reproduce Equation (15.2) by writing 

^ = (15.4) 

As in the derivation of the Schrodinger equation, we now make the assumption that 
Equation (15.4) is always valid, regardless of whether or not <f> is an eigenfunction 
of energy or momentum. Simplifying Equation (15.4) gives 


1 3 2 </> 
c 2 3 i 1 


- V 2 0 + 


2 2 
m i c 


-0 = 0 


(15.5) 


Equation (15.5) is called the Klein-Gordon equation. 

The Klein-Gordon equation describes a particle with spin 0, which limits its 
usefulness, since the particles of greatest interest (e.g., the electron, proton, etc.) 
all have spin 1 /2. Further, this equation leads to some problems connected with 
the interpretation of probabilities. To see this we need to digress. 


Probability Densities and Currents 

For the Schrodinger equation, the probability density is given by |0| 2 . However, it 
is not true that the corresponding quantity for the Klein-Gordon equation is |0| 2 . 
To find the probability density in this case, we need to introduce a new quantity 
called the probability current. 

We argue in analogy to a classical fluid. For a fluid with density p and velocity 
v, the rate dpjdt at which the density changes at a fixed point is given by 


— + V • (px) = 0 


(15.6) 




15.1 The Klein-Gordon Equation 


315 


The validity of this equation can be seen by integrating it over a closed volume 
V, and using the divergence theorem to transform the integral of V • (pv) into an 
integral over the surface: 


fv 9 ? + Ia^ ' dA = ° (15 ’ 7) 

The first term in this equation is just the rate at which the total mass inside the 
closed surface changes; the second term is the rate at which mass is crossing the 
surface. Thus, Equation (15.7) (or equivalently, Equation 15.6) simply says that 
the rate at which mass increases or decreases inside of a bounded region (the first 
term) is just given by the rate at which mass crosses the boundary of the region 
(the second term). Therefore, Equation (15.6) is called the continuity equation. 

In quantum mechanics, probability can be treated in exactly the same way. The 
quantity equivalent to the density p is just the probability density (which for the 
Schrodinger equation is p = \\j/ 1 2 ), and we can define a probability current J that 
satisfies the continuity equation for probabilities: 



(15.8) 


As an example, we determine the expression corresponding to J for the Schrodinger 
wave function. 


Example 15.1. The Nonrelativistic Probability Current. 

What is the value for J arising from the nonrelativistic Schrodinger equation? 
Substituting p = \l/*Tfr into Equation (15.8) gives 


r dt dt 


+ v.j = 0 


(15.9) 


The first two terms can be simplified using the nonrelativistic Schrodinger equation, 
which can be written, in the absence of a potential, as 






Multiplying by — gives the first term in Equation (15.9): 


* d\lf ih „ 

dt 2m 


and the complex conjugate of this equation gives the second term in Equation 
(15.9): 




316 


Chapter 15 What Comes Next? Relativistic Quantum Mechanics 


Substituting these expressions into Equation (15.9) gives 

v. j = — (\jf v 2 0* - ^*vV) 

2 m 

which can be integrated to give the expression for J: 



(15.10) 


Taking the same expression for j as given in Equation (15.10), but using the 
Klein-Gordon equation instead of the Schrodinger equation, we can derive an 
expression for p that satisfies the continuity equation (see Exercise 15.4): 


P = 


iti 

2m c 2 



(15.11) 


Note that this expression for p is quite different from our familiar expression p = 
0*0- In particular, the expression for p given by Equation (15.11) can be negative, 
which is clearly a “bad thing.” Note further that in using Equation (15.2), we have 
inadvertently introduced negative energy solutions! Equation (15.2) corresponds 
to both E = y/p 2 c 2 + m 2 c 4 and E = — J p 2 c 2 + m 2 c 4 . 


Example 15.2. Solution to the Klein-Gordon Equation for a Particle at Rest 
Consider a particle at rest for which p = 0. We will solve the Klein-Gordon equa¬ 
tion for this particle. 

Since the momentum is zero, we have — ihV<j> = 0, and Equation (15.5) be¬ 
comes 


1 3 2 <f> m 2 c 2 

c 2 li 2+ ~h r 

The most general solution to this equation is 


0 = 0 


0 = A ]e imc2t/h + A 2 e~ imc2t/h 


(15.12) 


where j 4 j and A 2 are unknown constants. Applying the energy operator ih(d/dt) 
to each term in Equation (15.12), we see that the first term corresponds to a state 
with negative energy, E = —me 2 , while the second term corresponds to a state 
with positive energy, E = me 2 . 


Because the Klein-Gordon equation contains a second derivative with respect 
to time, two boundary conditions are necessary to determine the solution: both 
0 and 30/3 1 must be specified. This is clear, for example, in the solution given 
by Equation (15.12); two boundary conditions are necessary to determine the two 
unknown constants. This represents an additional degree of freedom not present in 




15.2 The Dirac Equation 


317 


the Schrodinger equation. It is reasonable, therefore, to see if it is possible to find an 
equation corresponding to relativistic dynamics that contains only first derivatives 
with respect to time; the result will be the Dirac equation. 


15.2 ■ THE DIRAC EQUATION 


Consider what happens if we try to construct an operator equation corresponding 
to Equation (15.2) but restrict the equation to be first order in ail of the operators. 
As a first attempt, we write 

('*£) *-[- ihc ( a 'Tx +a % +a, h) + ^ mc2 ] * 

where a\ , <*2, «3> and ft are constants to be determined. In order to find the values of 
these constants, we square the operator on both sides of the equation, and require 
the final result to reduce to the Klein-Gordon equation: 


d^ilf 

—A 2 -—y = I —ihc^^ajVj + fimc 2 I | —ihc ^ + fimc 2 1 ^ 


dt 2 


j =1 


Jt=l 


) 


3 3 


= ( -h 2 c 2 J^Jl a J akV J Vk ~ ihmc3 J2^ a j +a j^ w j + P m2c4 * 

J =1 k =I ; / 

(15.13) 


where we define V] = d/dx, V 2 = 3/dy, and V 3 = d/dz. We want this to reduce 
to the Klein-Gordon equation, which can be written as 

In order for Equation (15.13) to reduce to the Klein-Gordon equation, the first 
term on the right-hand side of Equation (15.13) must simplify to —h 2 c 2 V 2 \jf, 
which requires 


and 


(XjOCj — 1 


(15.14) 


otjak + dkOtj = 0 , for j ^ k (15.15) 

The second term on the right-hand side of Equation (15.13) must vanish, which 
gives 


fia j + a jfi = 0 


(15.16) 



318 


Chapter 15 What Comes Next? Relativistic Quantum Mechanics 


Finally, the last term on the right-hand side of Equation (15.13) must reduce to 
/n 2 c 4 ^, so that 


p 1 = 1 ( 15 . 17 ) 

To summarize, each of the four constants ori, c* 2 > a 3 > and ft must square to give 
1 , but the four constants all anticommute with each other (a/ and ctj are said to 
anticommute if cr,a, = — otycr,). In fact, it is impossible to find four numbers that 
satisfy these relations! On the other hand , 1 it is possible to satisfy these relations 
if we take oq, aj, oq, and fi to be matrices. For example, the Pauli spin matri¬ 
ces from Chapter 8 satisfy exactly the desired relations: a]: = a 2 = a\ = /, and 
(j x o y + o y o x = 0, a x a z + a z a x — 0, cs y o z + a z a y = 0. The problem is that there 
are only three of these matrices and we need four. In order to find four mutually an¬ 
ticommuting matrices that all square to give the identity matrix, the minimum size 
of the matrices must be 4 x 4 . Thus, the wave function in our differential equation 
is no longer a one-component object (as it is in the Schrodinger and Klein-Gordon 
equations). Instead, it is a four-component column vector. There are an infinite 
number of different choices for matrices satisfying Equations (15.14)—(15.17), but 
the choice of which ones to use doesn’t change any physical calculations. The 
conventional choice is the following: 


and 


aj = 

U 5 ) 


11 

SQ. 

(I 0 \ 
(0 -1} 


where each symbol stands for a 2 x 2 matrix. Here the cry’s are the 2x2 Pauli 
spin matrices, and / is the 2 x 2 identity matrix. For example, 


/0 0 

0 0 

1 0 

\0 -1 


1 

0 

0 

0 


0 

-1 

0 

0 


p = 



0 0 0 \ 
1 0 0 I 

0-1 0 I 

0 0 - 1 / 


and 





15.2 The Dirac Equation 


319 


With these values for aj and /?, we can go back to our original equation and 
write 



(15.18) 


where a • V is shorthand for ct](d/dx) + o! 2 ( 9 / 3 y) 4 -a 3 ( 9 / 9 z); hence, a is a 
three-component object whose three components are each 4x4 matrices! Equa¬ 
tion (15.18) is called the Dirac equation. It forms the basis of relativistic quantum 
mechanics, and is perhaps the second most important equation in this book after 
the Schrodinger equation itself. Of course, as we have emphasized, the which 
appears in Equation (15.18) is really a four-component column vector: 



where each of the four components \ft i,..., ^ 4 , is a function of position and time. 
If all of the matrices in the Dirac equation are multiplied out, the Dirac equation 
breaks into a set of four differential equations relating these four components and 
their derivatives (see Exercise 15.6). 

To find the probability density and probability current for the Dirac wave func¬ 
tion, we begin with the Dirac equation and derive a relation which looks like the 
continuity equation. Starting with Equation (15.18), we first take the adjoint (i.e., 
the conjugate transpose) of this equation to obtain 


~ih~~ =ihc(V^)‘a + ir f pmc 2 (15.19) 

at 

where we have used the fact that all of the aj, ft matrices are Hermitian. Using 
Equations (15.18) and (15.19) for the time derivatives of ifr and V ft , 


3 + + 3 l/r 3i Ir^ 

-<*'« = 

— —cV • (\}r t a^r) 


(15.20) 


Equation (15.20) looks just like the continuity equation if we take p and J to be 
given by 


P _ 


J = c^cnff 


and 





320 


Chapter 15 What Comes Next? Relativistic Quantum Mechanics 


These then are the probability density and probability current for the Dirac equa¬ 
tion. Note that, unlike the case for the Klein-Gordon equation, the Dirac probability 
density is always nonnegative, since 

p = ^f = m 2 + W2I 2 + itoi 2 + i^i 2 


and each of these four terms is nonnegative. This is a good thing, since probabilities 
should be positive. 

Now consider the solutions of the Dirac equation for a particle at rest. (The 
solutions of the Dirac equation for nonzero momentum are derived in Exercise 
15.8.) For a particle at rest, p = 0 so the term —ihcfct • V)i jf is zero, and the Dirac 
equation becomes 


chfs 1 

ih — = Bmc if 
dt 


Multiplying out fiir on the right-hand side gives four ordinary differential equa¬ 
tions: 


ih 

ih 

ih 


din 

dt 

din 

dt 

din 

dt 


= me 2 i /1 


= me 1 it 2 
= —me 1 if 3 


■ t-dfa 2 , 

in -= —me if4 

dt 


Solving for i/?i, $ 2 , if 3 , and ir 4 and recombining the solutions back into the form 
of a column vector gives four linearly-independent solutions. The first two are 


f = 



(15.21) 


and 


i* — 



g—imc*tjh 


(15.22) 


The energies corresponding to these two solutions can be determined by applying 
the energy operator ih(d/dt)‘, we obtain 

E = me 2 


for both solutions. This is precisely the expected result for the energy of a particle 
at rest. But why are there two linearly-independent solutions? These solutions 



15.2 The Dirac Equation 


321 


apparently describe a two-component object. But we are already familiar with 
such an object: a spin-1/2 particle! The Dirac equation describes an elementary 
particle with spin of 1/2, and the wave function combines both the spatial and 
the spin information into a single quantity called a spinor. Thus, the wave function 
in Equation (15.21) represents a particle at rest with m s — +1/2, and the wave 
function in Equation (15.22) represents a particle at rest with m s = —1/2. These 
two solutions can be combined linearly to yield any other spin state for a spin-1/2 
particle. 

The other two Iinearly-independent solutions are 

pimc 2 l jh 


gimc 2 t/h 


It is tempting to dismiss these as “spurious” solutions with no physical signifi¬ 
cance, but Dirac did not do so. Rather, he assumed that these solutions were also 
valid. If this is the case, where are the negative-energy electrons? Dirac argued 
that in an ordinary vacuum, all of the positive-energy states are empty, while the 
negative-energy states are filled by a sea of electrons (Figure 15.1). If an electron 
is removed from a negative-energy state and boosted into a positive-energy state, 
it leaves behind a “hole” in the negative-energy states. The energy of this “hole” is 
the negative of the corresponding negative-energy electron state, so the “hole” has 
energy E = —(—me 2 ) = me 2 . In this way Dirac postulated the existence of anti¬ 
matter. The holes in the negative-energy states are antielectrons, called positrons. 
Note that it is not the electrons in the negative-energy states that correspond to 
positrons, but rather the holes in the negative-energy states generated by removing 
electrons from them. A positron can annihilate with an electron; this corresponds to 
the electron dropping back down and filling the vacant negative-energy state, thus 
eliminating both the electron and the hole. This prediction of Dirac was confirmed 
experimentally in 1932 with the discovery, by Carl Anderson, of the positron. 

This is only a small subset of the important results to which the Dirac equation 
leads. It is possible, for example, to show that the Dirac equation predicts that g s 
for the spin magnetic moment of the electron should be exactly 2 (a prediction 
which must be modified, by a small amount, using the more advanced results of 
quantum field theory). The Dirac equation forms one of the main pathways leading 
from classical quantum mechanics into our modern theories of particle physics. 


f = 


and 


f = 


which both have negative energy: 



322 


Chapter 15 What Comes Next? Relativistic Quantum Mechanics 



FIGURE 15.1 In a vacuum, all of the negative-energy states are filled with electrons, 
and all of the positive-energy states are empty. Moving an electron into a positive-energy 
state leaves behind a “hole” in the negative-energy states, corresponding to a positron. 
Electron-positron annihilation corresponds to the electron dropping back down and filling 
the hole. 


EXERCISES 

15.1 Show that the relativistic relation between energy and momentum (Equation 15.2) 
reduces to 


E = 


P 

— +mc 
2m 


2 


for the case when v <£ c. 

15.2 If <p is an eigenfunction of both energy and momentum, then another differential 
equation corresponding to Equation (15.2) is 

2A 


— It-)- (WO 2 + -TT# = 0 
c 2 \ dt } h 


Why is this a less desirable equation than the Klein^Gordon equation? 

15.3 (a) If J is the Schrodinger probability current, show that 


/ 


J<* 3 r= (v) 


(b) What are the units of J? 

15.4 Using the Klein-Gordon equation, the continuity equation, and the expression for J 
from Equation (15.10), derive the Klein-Gordon probability density: 


P = 


ih 

2 me 2 




) 


15.5 Write out explicitly the full 4x4 matrices corresponding to a i and a 2 . 




Exercises 


323 


15.6 Multiply out the matrices in the Dirac equation to express the Dirac equation as four 
coupled differential equations for the four components of \}r\ \{f \, ^ 2 * and ^ 4 . 

15.7 Write down the Dirac spinor corresponding to a spin-1 /2 particle at rest with spin in 
the +* direction and positive energy. 

15.8 (a) The general solution for the Dirac equation can be written in the form 


f = 


<h 

Xi 

'Z2/ 




where fa* Xu and X 2 are numbers independent of r and t . To take advantage 
of this form for the Dirac equation, use the shorthand 


and 




Using this form for the solution, show that 4> and x satisfy the coupled equations 

(E - mc 2 )<t> = c(p • <t)x 


and 


( E 4- mc 2 )x = c(p • o)4> 

(b) Use the results from part (a) to show that the general four-component solution of 
the Dirac equation may be written as 

f = ( ^ \ e i{ »' r - Et)/h 

\c(p • a)<f>f(E + me 2 ) / 




Answers and Hints for Selected 
End-of-Chapter Exercises 


For many of the exercises, the answer is given in the exercise itself. Here are 
answers and hints for some of the remaining exercises. All numerical answers are 
given to two significant figures. 

Chapter 1 

1.3 (a) p = 4.0 x 10 "' 4 J/m 3 (b) p = 2.3 x 1 (T 16 i/m 3 

1.6 Hint: When hv 2 > fcT.it is a good approximation to take e hv ^ kT — 1 ~ e hv /kT 

1.8 A = 4800 A 

1.12 (a) A = 9.0 x L0 - 14 m (b) A = 4.3 x 10 “ 16 m 

Chapter 2 

2.5 Hint: Use the polar form for i. 

2H (i) linear (ii) linear (iii) not linear 


Chapter 3 

3.1 (b) ^ = {\fljn 1 /4 ) (kmjh 2 ) 3 ^ 

3.3 (a) No (b) Yes, E = hco/2. 


Chapter 4 

4.4 (a) rj/(x) = (A/2)(e‘ kx + e~ ,kx ) for x < 0; ir(x) = A for x > 0, where k = 
V2 mE(h (b) R = 1 

4.11 (c) sjlmV^a 1 = Ajr(n + 1/2), n = 0,1, 2,... 

4.14 Hint: Begin with the Schrodinger equation for this potential, but do not try 
to solve it. Instead, make a change of variables similar to what we did for the 
harmonic oscillator so that the final equation looks like 


d 2 if 
ds 2 


+ (A — s p ) = 0 


What does this indicate about the dependence of the energy levels on ml 


Chapter 5 

5.4 (b) AB is not Hermitian. 

5.5 (b) V(x) = V(—x), i.e., symmetric potentials 


325 



326 


Answers and Hints for Selected End-of-Chapter Exercises 


Chapter 6 

6.1(b) P = 1/4- 1/(2t r) 

6.14 P = 0.73 

6.17 Hint: Write the Hamiltonian in spherical coordinates and calculate the com¬ 
mutator of H and L z . 

6.18 (b) Hint: If the volume of the sphere is increased by a small amount d V, then 
the work done by the particle is pdV, where p is the pressure. Conservation of 
energy implies that if the particle does work, its energy must change. 


Chapter 7 
7.5 


1 

n 



7.10 Q = ZHZ 


Chapter 8 
8.5 


8.10 (a) 



1 

0 

1 


0 

1 

0 



0 


) 



0 ov 
0 0 ) 
0 - 1 / 


_ h ( 0 
2 



) 


Eigenvalues are h /2 and —hj 2 corresponding to normalized eigenvectors 

and 




respectively. 



Answers and Hints for Selected End-of-Chapter Exercises 


327 


(b) P{m sx = -1/2) = sin 2 (0/2); P{m s = 1/2) = 1/2 

(c) P(m„ = -1/2) = 1/2 

8.14 (a) P = 1/2 (b) P = cos 2 (wf/2) 

Chapter 9 

9.3 (a) Energy length (b) E (1) = Tkja 
(c) 

c2 ) 8mA. 2 ^ 1 2mA. 2 

= ^ I - (2k + I) 2 = 

9.6 Hint: First show that the potential produced by a uniformly-charged sphere of 
charge +e and radius ro is 


e 2 ( r 2 3 ) 

rS,a 


e 2 1 

V(r) = ---, r > r 0 

4:reo r 


9.13 £ (1) = -(e 2 /4n-e 0 )(l /R) 

Chapter 10 

10.5 Abetter approximation! 

10.7 (a) P estimated = y/3ft(0 

Chapter 11 

11.1 P = (32768/59049)e 2 f^a 2 /[(ft/r) 2 + (E 2 - £ ( ) 2 ] 
11.11 n’ x = n x , n' y = n y + an odd positive integer, n' = n z 


Chapter 12 

12.3 (a) dcfdi2 = 4sin 2 [(k/ - k,) • d/2](da/dQ) R 

12.4 do/dSl = (mA/nh 2 ) 2 [cos(ka sin $ cos <f>) + cos (ka sin 6 sin <f> )] 2 

12.8 Hint: Solve the Schrodinger equation for r > R and r < R, and use the 
appropriate boundary conditions to join the solutions at r = R. 

Chapter 13 
13.3 



(*yi> 

l . /7rzi> 

1 . (nx 2 \ . 

(xyi\ 

\ * / 7r ^2\ 

1 -) sin | 

——, 

sin 1-. 

sin (- ) sw 

—— 

| sm (-) 

V a J 

V b J 

’ \ c > 

’ V a / ' 

\ b ) 

\ c / 


n 2 jt 2 n r 

m a 2 b 2 c 2 _ 



328 


Answers and Hints for Selected End-of-Chapter Exercises 


13.5(a) E (,) = (3/2 )(K/a) 

13.11 Allowed L, S, and J values are: 'G4, 3 F 2 . 3 F4, 1 D 2 , 3 Po. 3 Pu 3 P 2 , 

'So; ground state is 3 F 2 . 

Chapter 14 

14.1 Hint: What is the magnetic moment of the oxygen-16 nucleus? 

14.5 (b) 



Chapter 15 

15.3 (a) Hint: Express J in terms of the momentum operator. 



Index 


Addition 

of complex numbers, 23 
of linear operators, 96 
of vectors, 100 
Adiabatic change, 237 
Adjoint operator, 105-106 
Alkali metals, ground state, 295 
a decay, 79 
Anderson, Carl, 321 
Angular frequency, 37 
Angular momentum, 117-125 
atomic structure and, 295-296 
classical, 161 

orbital angular momentum, 
121, 159 

spin angular momentum, 121, 
159-190 

total angular momentum, 
164-166, 207, 295 
Angular momentum operator, 
121, 126-127 

Angular momentum quantum 
number, 137 

Anharmonic oscillator, 201-203 
Anomalous magnetic moment, 
162 

Anticommuting, 318 
Antimatter, 321 
Antisymmetric wave function, 
287, 289 

Argon, ground state, 294 
Atomic structure 
of alkali metals, 295 
angular momentum and, 
295-296 
of argon, 294 
Aufbau principle, 294 
of beryllium, 300 
Bohr atom, 17-20, 140 


of boron, 294,296-297 
of carbon, 297-300 
fine structure, 205 
of halogens, 295 
of helium, 230-234, 291-292, 
294, 296 

Hund's rules, 296, 300 
of hydrogen, 133-141, 
204-221,245-246, 
251-252,294,295-296 
of lithium, 294,295 
of multielectron atoms, 
293-300 
of neon, 294 

perturbations to atomic energy 
levels, 204-213 
of potassium, 295 
Rutherford model, 18 
of sodium, 295 
Aufbau principle, 294 

B aimer, Johann, 18 
Balmer series, 18 
Basis sets, 107-110 
Bell, J-S„ 189 

Beryllium, ground state, 300 
Binary gates, classical 

computing, 308-309 
Blackbody radiation, 2-10 
Bohr atom, 17, 140 
Bohr magneton, 161 
Bohr, Neils, 17, 18 
Bohr radius, 138 
Boltzmann distribution, 7 
Boltzmann's constant, 6 
Bom, Max, 44, 46,47 


Bom approximation, 261-271 
defined, 266 
scattering from a 

delta-function potential, 
269-271 

scattering from an infinitely 
hard sphere, 277-278 
scattering from 

three-dimensional 
repulsive spherical well, 
268-269 

Boron, atomic structure, 294, 
296-297 

Bose-Einstein statistics, 285 
Bosons, 160, 285, 288 
Bound states, 63,79-91, 83-91 
Bracket, 153 

Bromine, ground state, 295 

Carbon, ground state, 297-300 
Central potentials, 125, 133 
Chlorine, atomic structure, 295 
Classical harmonic oscillator, 
89-90 

Classical physics 
momentum of particle, 117 
particles, 35 
postulates, 1 

Clebsch-Gordon coefficients, 
183 

Column vectors, 166 
Commutators, 97-98, 119 
Complex conjugation, 29-30 
Complex numbers, 23-30 
addition, 23 

complex conjugation, 29-30 
converting to polar form, 
26-27 

division, 24, 27 


329 



